help-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: local database types for personal use


From: Alan E. Davis
Subject: Re: local database types for personal use
Date: Sun, 22 Jan 2012 07:51:16 -0800

Did you find a format amenable to your needs?  

I had intended to send this short example of band format some months ago.    Ban format is perfect for what you are doing.  You can write simple utility routines to search or sort the database. 

Two entries in  band format follow.  A record does not need to occupy on line.  "..hw" always begins a new record:

..hw lun kaeja  .gs   .ge Jellyfish (secondary etc names describe the specific type)  .ec Jellyfish  .cg cnidaria  .la Mokil  .src  .n Father of a student   .dt [2010-08-27 Fri]

..hw lagong  .gs   .ge Big, green fly  .ec   .cg Insect  .la Cebuano  .src FSD .n came up as an exception to the general name for fly.   .dt [2010-06-11 Fri]

.. begins a record
..hw  headword tag
.gs  Scientific Name
.ge  English gloss
.ec  English common name
.cg  catagory
.la  language
..src  Source
.dt  date

I wrote some little macros / routines that converted a record into a typeset dictionary in LaTeX.  A frontend, as it were.  Each tag is typeset in a distinct font.  Robert Hsu who was at the University of Hawaii linguistics department in the 90s, wrote a spitbol/snobol4 package for manipulating data in band format, used in producing several dictionaries of several pacific languages.   Others have developed band formats distinct from this one. 

Alan


On Mon, Dec 26, 2011 at 11:08 AM, Alan E. Davis <lngndvs@gmail.com> wrote:
Band format is a free-form format used in linguistics for lexical data.  

The following is  from a google search.

http://billposer.org/Linguistics/Computation/LectureNotes/ParsingLexica.html

I have used my own version of this format for a lexicon.  If you wish, I can spend more time explaining how ti did it.

Basically, each "field" is marked by a dot and one or more letters.  Two dots mark the beginning of a record.

..hw  means "headword"  (use anything you want)
.d    could mean date
.ge  could mean "english gloss"
.gd  could mean "dutch gloss"
.p    could mean part of speech.

Two spaces preceed each .X 

New categories can be made up on the fly.

Alan Davis


On Sun, Dec 25, 2011 at 5:57 PM, Thien-Thi Nguyen <ttn@gnuvola.org> wrote:
() Soichi Ishida <soujiro0725@gmail.com>
() Mon, 26 Dec 2011 10:47:58 +0900

  I have totally forgotten EBD :)

  What if I have more than one table?

Then your forgetfulness is ideal.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]