Re: Re-approaching package tagging

guix-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Re-approaching package tagging

From:	swedebugia
Subject:	Re: Re-approaching package tagging
Date:	Wed, 19 Dec 2018 08:42:24 +0100

On 2018-12-19 07:51, swedebugia wrote:

On 2018-12-18 08:48, Catonano wrote:
Il giorno lun 17 dic 2018 alle ore 22:10 swedebugia<address@hidden <mailto:address@hidden>> ha scritto:
    Hi :)

    On 2018-12-17 20:01, Christopher Lemmer Webber wrote:
     > Hello,
     >
     > In the past when we've discussed package tagging, I think Ludo'
    has been
> against it, primarily because it's a giant source ofbikeshedding. I
     > agree that it's a huge space for bikeshedding... no space
    provides more
> bikeshedding than naming things, and tagging things is a manyto many
     > naming system.
     >
     > However, I will say that finding packages based on topical
    interest is
     > pretty hard right now.  If I want to find all the available
    roguelikes:
     >
     > address@hidden:~$ guix package -A rogue
     > hyperrogue    10.5    out     gnu/packages/games.scm:3652:2
> roguebox-adventures 2.2.1 out gnu/packages/games.scm:1047:2
     >
     > Hm, that's strange, there's definitely more roguelikes that
    should show
     > up than that!  A more specific search is even worse:
     >
     > address@hidden:~$ guix package -A roguelike
     > address@hidden:~$
     >
     > What I should have gotten back:
     >   - angband
     >   - cataclysm-dda
     >   - crawl
     >   - crawl-tiles
     >   - hyperrogue
     >   - nethack
     >   - roguebox-adventures
     >   - tome4
     >
     > So I only got 1/4 of the entries I was interested in in my first
    query.
     > Too bad!
     >
> I get that we're opening up space for bikeshedding and *that'strue*.
     > But it seems like not doing so makes things hard on users.
     >
     > What do you think?  Is there a way to open the (pandora's?) box
    of tags
     > safely?

    Yes and no.

    Pjotr and I have discussed this relating to biotech software. He said
    that many scientists have a hard time finding the right tools for
    the job.

    I proposed tight integration with wikidata[1] (every software in the
    world will eventually have an item there) and Guix (QID on every
    package
    and lookup/catogory integration) and leave all the categorizing to
    them.
    Ha problem sidestepped, they are bikeshedding experts over there in
    wikiland! :D

    The advantage of this is that everyone using wikidata (every package
manager) could pull the same categorization so we only do it oncein a
    central

    What do you think?

    --


There is also the Free Software Directory
https://directory.fsf.org/wiki/Main_Page

I don't know what the relationship between Wikidata and the FSD is

Does Wikidata import data from the FSD ? Or viceversa ?
I don't know. For now at least they keep reference to the FSD onsoftware-entries that exists in the FSD.
We could integrate the FSD also but I have yet to investigate if theyprovide an API for their entries.
Anyways I view FSD as a subset of Wikidata/Wikipedia. Wikidata is thenode and FSD the leaf. Wikidata/Wikipedia will probably within a fewyears contain the data or links to the data that now exists in the FSD.
Correct me if I'm wrong but the only advantage of FSD over Wikidata &Wikipedia is that they do not include references to proprietary softwareat all.
In my view it is more feasible to compile the information on in astructured way in central node and then pull the relevant bits to the leaf.
E.g. FSD of the future could be generated from all wikidata-entries andextracts of wikipedia that are an instance ofhttps://www.wikidata.org/wiki/Q341. This would avoid fragmentation andhelp concentrate on building a large shared collective source of allknowledge within the wiki-community. FSD could exist anyhow and surelyhelp enrich the upstream data.
Similarly we could generate a wikipedia subset without any entriespointing to (evil) private corporations (any entries that is part ofhttps://www.wikidata.org/wiki/Q5621421 or whatever). I can't imaginewhat this would be good for but it its possible.
I cannot imagine that the information in FSD would not be accepted inany of the wikimedia projects. I could be wrong though as I honestly didnot visit or study the FSD very much.

Also the license of the FSD (GFDL 1.2) differs from both Wikidata (CC0)and Wikipedia (CC-BY-SA 4.0 + GFDL 1.2).


This is not to their advantage in the long run.

I fear the FSD is already becoming unmaintained and obsolete with peoplefavoring more open and smarter solutions from the wikimedia-projects (atleast I am).

When it comes to completeness we have at least 500.000 packages missingin both Wikidata and FSD (450.000+ MIT & CC0 licensed npm packages).Would any of you like to import those twice? I don't and as I see itWikidata is far superior in multiple ways to get the job done and do itwell with a big community backing it up with tools, bots, manual edits,et all. Who wants to update with new versions in two places when we haveover half a million free software packages to juggle?


Here is a small comparison example:

Top 8 JS packages according tohttps://github.com/search?o=desc&q=js&s=stars&type=Repositories(900.000+ repositories in total!) (i filtered out a few non softwares)


1. angular.js
https://www.wikidata.org/wiki/Q28925578
https://directory.fsf.org/wiki/Angular2
2. node
https://directory.fsf.org/wiki/Node#tab=Overview
https://www.wikidata.org/wiki/Q756100
3. axios
not found in either
4. three.js
https://www.wikidata.org/wiki/Q3525922
https://directory.fsf.org/wiki/Three.js
5. socket.io
https://www.wikidata.org/wiki/Q7552998
not found (poor search function in my view)
6. reveal.js
not found
not found
7. chart.js
not found
not found
8. json-server
not found
not found

Wikidata already contains way more entries and data on the entries Icompared (e.g. node, npm, gcc) than FSD despite it being a much youngerproject.


--
Cheers Swedebugia

[Prev in Thread]

Current Thread

[Next in Thread]

Re-approaching package tagging, Christopher Lemmer Webber, 2018/12/17
- Re: Re-approaching package tagging, swedebugia, 2018/12/17
  - Re: Re-approaching package tagging, zimoun, 2018/12/17
  - Re: Re-approaching package tagging, Catonano, 2018/12/18
    - Re: Re-approaching package tagging, Catonano, 2018/12/18
    - Re: Re-approaching package tagging, swedebugia, 2018/12/19
    - Re: Re-approaching package tagging, swedebugia <=
- Re: Re-approaching package tagging, Ludovic Courtès, 2018/12/18
  - Re: Re-approaching package tagging, Christopher Lemmer Webber, 2018/12/18
    - Re: Re-approaching package tagging, zimoun, 2018/12/18
    - Re: Re-approaching package tagging, Ludovic Courtès, 2018/12/19
    - Re: Re-approaching package tagging, zimoun, 2018/12/19
    - Re: Re-approaching package tagging, zimoun, 2018/12/19
    - Re: Re-approaching package tagging, Ludovic Courtès, 2018/12/20
    - Re: Re-approaching package tagging, Chris Marusich, 2018/12/20
    - Re: Re-approaching package tagging, Ludovic Courtès, 2018/12/21
    - Re: Re-approaching package tagging, Christopher Lemmer Webber, 2018/12/20

Prev by Date: Re: Re-approaching package tagging
Next by Date: Re: Packaging Terraform, a Golang package
Previous by thread: Re: Re-approaching package tagging
Next by thread: Re: Re-approaching package tagging
Index(es):
- Date
- Thread