Skip to content
This repository has been archived by the owner on Oct 3, 2022. It is now read-only.

Fuzzy tag match #59

Open
DonaldTsang opened this issue Dec 19, 2018 · 4 comments
Open

Fuzzy tag match #59

DonaldTsang opened this issue Dec 19, 2018 · 4 comments

Comments

@DonaldTsang
Copy link

Matching tags with similar pronunciation or spelling
Similar to https://gitgud.io/Dizmal/borehole

@bakape
Copy link
Owner

bakape commented Mar 11, 2019

Accounting for spelling mistakes would lead to too much noise. Use substring matching.

@DonaldTsang
Copy link
Author

@bakape I would recommend doing research on String metrics https://en.wikipedia.org/wiki/String_metric and that there are many algorithms that account for spelling mistakes... but then again a simpler way would be to use phonetic encoding https://en.wikipedia.org/wiki/Phonetic_encoding which reduces complexity (assuming you know what most tags look like phonetically)

@bakape
Copy link
Owner

bakape commented Mar 11, 2019 via email

@DonaldTsang
Copy link
Author

DonaldTsang commented Mar 13, 2019

@bakape in this case, to avoid adding string metric functions, phonetic-encoded substrings would be useful, all that is required is to add an extra column in the tag database to include a phonetic encoding.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants