|
From: | swedebugia |
Subject: | Re: Re-approaching package tagging |
Date: | Wed, 19 Dec 2018 08:42:24 +0100 |
On 2018-12-19 07:51, swedebugia wrote:
On 2018-12-18 08:48, Catonano wrote:Il giorno lun 17 dic 2018 alle ore 22:10 swedebugia <address@hidden <mailto:address@hidden>> ha scritto:Hi :) On 2018-12-17 20:01, Christopher Lemmer Webber wrote: > Hello, > > In the past when we've discussed package tagging, I think Ludo' has been> against it, primarily because it's a giant source of bikeshedding. I> agree that it's a huge space for bikeshedding... no space provides more> bikeshedding than naming things, and tagging things is a many to many> naming system. > > However, I will say that finding packages based on topical interest is > pretty hard right now. If I want to find all the available roguelikes: > > address@hidden:~$ guix package -A rogue > hyperrogue 10.5 out gnu/packages/games.scm:3652:2> roguebox-adventures 2.2.1 out gnu/packages/games.scm:1047:2> > Hm, that's strange, there's definitely more roguelikes that should show > up than that! A more specific search is even worse: > > address@hidden:~$ guix package -A roguelike > address@hidden:~$ > > What I should have gotten back: > - angband > - cataclysm-dda > - crawl > - crawl-tiles > - hyperrogue > - nethack > - roguebox-adventures > - tome4 > > So I only got 1/4 of the entries I was interested in in my first query. > Too bad! >> I get that we're opening up space for bikeshedding and *that's true*.> But it seems like not doing so makes things hard on users. > > What do you think? Is there a way to open the (pandora's?) box of tags > safely? Yes and no. Pjotr and I have discussed this relating to biotech software. He said that many scientists have a hard time finding the right tools for the job. I proposed tight integration with wikidata[1] (every software in the world will eventually have an item there) and Guix (QID on every package and lookup/catogory integration) and leave all the categorizing to them. Ha problem sidestepped, they are bikeshedding experts over there in wikiland! :D The advantage of this is that everyone using wikidata (every packagemanager) could pull the same categorization so we only do it once in acentral What do you think? -- There is also the Free Software Directory https://directory.fsf.org/wiki/Main_Page I don't know what the relationship between Wikidata and the FSD is Does Wikidata import data from the FSD ? Or viceversa ?I don't know. For now at least they keep reference to the FSD on software-entries that exists in the FSD.We could integrate the FSD also but I have yet to investigate if they provide an API for their entries.Anyways I view FSD as a subset of Wikidata/Wikipedia. Wikidata is the node and FSD the leaf. Wikidata/Wikipedia will probably within a few years contain the data or links to the data that now exists in the FSD.Correct me if I'm wrong but the only advantage of FSD over Wikidata & Wikipedia is that they do not include references to proprietary software at all.In my view it is more feasible to compile the information on in a structured way in central node and then pull the relevant bits to the leaf.E.g. FSD of the future could be generated from all wikidata-entries and extracts of wikipedia that are an instance of https://www.wikidata.org/wiki/Q341. This would avoid fragmentation and help concentrate on building a large shared collective source of all knowledge within the wiki-community. FSD could exist anyhow and surely help enrich the upstream data.Similarly we could generate a wikipedia subset without any entries pointing to (evil) private corporations (any entries that is part of https://www.wikidata.org/wiki/Q5621421 or whatever). I can't imagine what this would be good for but it its possible.I cannot imagine that the information in FSD would not be accepted in any of the wikimedia projects. I could be wrong though as I honestly did not visit or study the FSD very much.
Also the license of the FSD (GFDL 1.2) differs from both Wikidata (CC0) and Wikipedia (CC-BY-SA 4.0 + GFDL 1.2).
This is not to their advantage in the long run.I fear the FSD is already becoming unmaintained and obsolete with people favoring more open and smarter solutions from the wikimedia-projects (at least I am).
When it comes to completeness we have at least 500.000 packages missing in both Wikidata and FSD (450.000+ MIT & CC0 licensed npm packages). Would any of you like to import those twice? I don't and as I see it Wikidata is far superior in multiple ways to get the job done and do it well with a big community backing it up with tools, bots, manual edits, et all. Who wants to update with new versions in two places when we have over half a million free software packages to juggle?
Here is a small comparison example:Top 8 JS packages according to https://github.com/search?o=desc&q=js&s=stars&type=Repositories (900.000+ repositories in total!) (i filtered out a few non softwares)
1. angular.js https://www.wikidata.org/wiki/Q28925578 https://directory.fsf.org/wiki/Angular2 2. node https://directory.fsf.org/wiki/Node#tab=Overview https://www.wikidata.org/wiki/Q756100 3. axios not found in either 4. three.js https://www.wikidata.org/wiki/Q3525922 https://directory.fsf.org/wiki/Three.js 5. socket.io https://www.wikidata.org/wiki/Q7552998 not found (poor search function in my view) 6. reveal.js not found not found 7. chart.js not found not found 8. json-server not found not foundWikidata already contains way more entries and data on the entries I compared (e.g. node, npm, gcc) than FSD despite it being a much younger project.
-- Cheers Swedebugia
[Prev in Thread] | Current Thread | [Next in Thread] |