Re: Open source fuzzy project



Lee,

In your wiki, http://dedupe.sourceforge.net/, you speak about
"Fuzzy String Matching". This functionality is often achieved
without using any fuzzy logic based technique. One should
really speak about "approximate string matching".

Unfortunately, some people use the terms
"fuzzy string matching" and "fuzzy string search" without
even thinking about fuzzy logic. If you read
http://en.wikipedia.org/wiki/Fuzzy_string_searching you can find
"Fuzzy string searching is the name for a category of techniques for
finding strings that approximately match some given pattern string."

Fine, except that I do not think they use any real fuzzy algorithm.
They actually quote some great papers & book, but as far as I know,
none uses fuzzy techniques. I have actually spent some time looking
for an implementation of fuzzy string matching that uses fuzzy logic
based techniques, but I have not been able to come up with anything
decent.

As approximate string matching, Google is great. I search
"fuzy loiic" and answers
Did you mean: "fuzzy logic"?

I do not know if they use "fuzzy logic" or some algorithm based on
Levenshtein distance. However, this level of "understanding of
human behaviour" is great.

Best regards

Bruno Di Stefano


ltickett@xxxxxxxxx wrote:
Afternoon, Morning, Evening to you all,

I've just stumbled upon this group and begun reading and have found
some really interesting info/discussions!

I have recently begun an open source project "dedupe" on sourceforge (
http://sourceforge.net/projects/dedupe ) and have setup a wiki (
http://dedupe.sourceforge.net ) to collate information and discuss the
project progress, direction etc!

If anyone is interested (just to look) or feels they can contribute
please get in touch!

Thanks

L
ltickett@xxxxxxxxx

.



Relevant Pages