[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Fwd: Re: [Gnumed-devel] phrase wheel
From: |
Hilmar Berger |
Subject: |
Re: Fwd: Re: [Gnumed-devel] phrase wheel |
Date: |
Sun, 14 Sep 2003 19:56:27 +0200 (MEST) |
> When typing even more the phrase wheel starts matching even
> true substrings such that eventually:
>
> "pain" does match "repainted house"
Is that *really* necessary ? As far as I know substring matching inside of
words is
much more time-consuming at the backend than is matching at the start of a
term.
Maybe we should make this a special case. We could use wildcards to indicate
that
we really want to do this :
"*pain" or "*pain*" matches "repainted" , "pain" does only match on start of
term.
> Now, Richard pointed out that some characters need to be
> ignored:
>
> "?aneurysm" should loose the "?" for matching.
>
> Those are stripped.
What if these characters have a meaning ? Maybe we should try to expand
special characters like
"?aneurysm" = "possible anerysm"
> - (drop-dead painful, perhaps debatable)
> <space> (pain severe)
> <tab> (pain severe)
> = (pain=severe)
> + (pain+swelling)
> & (pain&swelling)
> : (pain:severe)
> _ (pain_severity, this is debatable)
> @ (address@hidden)
>
> Hence:
> > > default_word_separators = re.compile('[- \t=+&:address@hidden')
Same problem with special characters - maybe we should try to expand them.
> > > default_phrase_separators = re.compile('[;/|]+')
> Those are used to separate input terms to be used as fragments
> for matching. If in a lab request phrase wheel I type
>
> "ESR;CRP;DBC" or "ESR/CRP/ASL/DBC" ...
>
> I'd like the phrase wheel to deliver matches based on the
> phrase I am in, say the "CRP" part of the input.
Well, now I understand.
> > > default_ignored_chars = re.compile("""[?!."'\\(){}\[\]<>~#*$%^]+""")
> Well, those get ignored right from the beginning.
Again, we might want to expand them first.
Hilmar
--
COMPUTERBILD 15/03: Premium-e-mail-Dienste im Test
--------------------------------------------------
1. GMX TopMail - Platz 1 und Testsieger!
2. GMX ProMail - Platz 2 und Preis-Qualitätssieger!
3. Arcor - 4. web.de - 5. T-Online - 6. freenet.de - 7. daybyday - 8. e-Post
[Gnumed-devel] phrase wheel, Karsten Hilbert, 2003/09/15