Re: [Help-gnunet] finding files & database management

help-gnunet

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Help-gnunet] finding files & database management

From:	Krista Bennett
Subject:	Re: [Help-gnunet] finding files & database management
Date:	Fri, 12 Mar 2004 13:55:03 -0500
User-agent:	Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7a) Gecko/20040219

Hopefully I can answer most of this; I spend a lot of my time out of theloop (and so sometimes the clever folk change parts of the scheme onme), but I think I can give you some sense of what the answer is and whywith my usual annoying verbosity.


Benjamin Kay wrote:

With very little content currently on GNUnet, finding files isn't easy.

This is true, and has long been known to be an issue; however, withoutmany users, we don't have much content. Now that GNUnet is increasinglystable, and with windows port action going on, that may change in thefuture.

To complicate things, keyword matching in a search seems to be explicit and casesensitive.

This is also true; while we could certainly add an option to the searchutility to have it look for a keyword in various case configurations,eliminating the case sensitivity in the encoding scheme itself would bea problem; since we look for keywords and hash-key-indexed content inthe same way, this explicitness is simply part of how things work.

Doesn't mean we couldn't add something to the insert utility toautomatically add stuff using various cases though!

To make files I insert/index easier to find, I try to include asmany relevant keywords as possible - but inevitably, I still think of a fewadditional keywords after I've inserted/indexed the file. The same goes forfile descriptions. I know I can reinsert the file with the new keywords anddescriptions, but that is costly in terms of processing time and requiresmeticulous record keeping on my part (I need to keep track of under whatdescription and keywords the original file was inserted).

Well... I suppose it's possible to automate some sort of external recordof what you've inserted under various keywords and have it point to thetop block of the file so that you could continuously reindex that blockwith different keywords. That might be something useful to have.

That's really so hard, methinks; as an aside, the problem with doingthat is that for the person using such a method, there is then aconcrete record of content you've inserted. From a "plausibledeniability" standpoint, you then open yourself up to trouble, asthere's not only a concrete record of what you've inserted into thenetwork, but a pointer to the file itself - if I insert something Idon't want attributed to me (for example, my dissertation drafts :),it's probably not smart for me to intentionally retain a record. Itdoesn't hurt the network, just me, but it's just something to think about.

This isn't a problem for the network in any sense, and I suppose it's nodifferent than you keeping track of such stuff on your own.

So the short question and answer is: could something be incorporated sothat you could add additionally descriptions and keywords to an existingtop block without a complete reinsertion/reindexing? Unless there's beensome radical changes in the encoding scheme since the last time I lookedat it, sure, I think it's possible (as long as the previouslyindexed/inserted top content block is still around or can beconstructed). Christian, Igor, Nils, and company will correct me if I'mwrong, I'm sure.

Is there a way tomodify the description and/or keywords of an inserted or indexed file withoutreinsertion?

As I said above, unless I'm forgetting something vital, it could be madepossible to add to the keyword list given a reference to the top block.

Now, to actually "modify" the description, that's a bit more of aproblem, and that has something to do with the censorship-resistantnature of the network. If I insert the same file 100 times under thesame keyword, given that the filename of the highest block in thecontent tree is a function of the keyword, it should overwrite my localcopy of that top block.

(Is that right Christian, or did you and Igor do something tricky andnew I'm forgetting about?)

So in that case, you can "modify" the description by reinserting the topblock with the same keyword as before but a different description aslong as the only copy of that block is on your machine.

On the other hand, if that keyword block has migrated for any reason,you can't do a darned thing about the already existing keyword blocksthat are out on the network. Nor should you be able to; if I insert mydissertation under the keywords "Kristas_dissertation" and thedescription "Draft copy of dissertation on stuff and things - do nottake internally without consulting a physician", I don't particularlywant anyone else to go through the network and change the description to"Important government dossier on weapons of mass destruction - use ingovernment press briefings" for every single keyword block out there. Soonce it's out in the network, it stays there until more importantcontent comes along and it fades away.

So, again, the short answer is that it could be done by just"reinserting" the top block alone with a new keyword or description, butif that top block has migrated, you're stuck with the two versions inthe network.

How about a way to view the descriptions and keywords of indexed files?

Unless you keep this externally (i.e. you keep a record of every keywordyou've indexed) somehow, no; again, this is intentional. Part of whatmakes the AFS portion of GNUnet work is that you retain plausibledeniability; this means that if someone goes to your machine and says"hey, we're going to confiscate your machine, search it, and destroy itbecause you have nude pictures of Dick Cheney on it", you can honestlysay you had no way of knowing they were there short of brute forcesearching for nude pictures of Dick Cheney. (??!??!!)

Furthermore, all of the blocks you've got stored look the same toGNUnet, keyword-indexed or not, so unless you have the keywordssomewhere, you can't do a reverse-lookup.

Along the same lines, is there a way to reindex a downloaded file without,well... reindexing it? I'm guessing that on nodes with content migrationenabled, downloaded content gets inserted into the migration database.Perhaps there is a way to make it permanent on that node (index it) withoutwasting all that time manually reindexing it? And is it possible to reindexsuch files under their original descriptions and keywords?

Hrm... I'll let Christian handle that one. I'm probably just not parsingyour question the way you intend it :)

I've probably confused you more than I've helped, but the casesensitivity thing and having some sort of external indexing utilitysound like plausible (and fairly easy-to-implement) features to me. IfI'm feeling ambitious this afternoon, maybe I'll even do something about it.


- Krista

--
***********************************************************************
Krista Bennett                             web.ics.purdue.edu/~bennetkl
Graduate Student in Linguistics              address@hidden
Purdue University
**                                                                   **
If you think education is expensive, try ignorance. - Benjamin Franklin
**                                                                   **

[Prev in Thread]

Current Thread

[Next in Thread]

[Help-gnunet] finding files & database management, Benjamin Kay, 2004/03/12
- Re: [Help-gnunet] finding files & database management, Krista Bennett <=
  - Re: [Help-gnunet] finding files & database management, Krista Bennett, 2004/03/12
- Re: [Help-gnunet] finding files & database management, Markku Tavasti, 2004/03/12
  - Re: [Help-gnunet] finding files & database management, Igor Wronsky, 2004/03/13
- Re: [Help-gnunet] finding files & database management, Igor Wronsky, 2004/03/13
- Fwd: Re: [Help-gnunet] finding files & database management, Benjamin Kay, 2004/03/12
  - Re: Fwd: Re: [Help-gnunet] finding files & database management, Krista Bennett, 2004/03/13
    - Re: Fwd: Re: [Help-gnunet] finding files & database management, Benjamin Kay, 2004/03/13
    - Re: Fwd: Re: [Help-gnunet] finding files & database management, Krista Bennett, 2004/03/17

Prev by Date: [Help-gnunet] finding files & database management
Next by Date: Fwd: Re: [Help-gnunet] finding files & database management
Previous by thread: [Help-gnunet] finding files & database management
Next by thread: Re: [Help-gnunet] finding files & database management
Index(es):
- Date
- Thread