Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

So this identifies keys from source and target objects that are fuzzy synonyms and copies the values over. What is a real world use case for this? Add the fact that it's fuzzy and won't always work, so would require a great deal of extra effort in QA/testing (harder than just mapping the keys programmatically), and I'm puzzled.


We do something very similar with embeddings in our product. Users import files that they have to match to a dynamically-defined target schema. The embedding matching provides suggested matches to the user that are generally very accurate, so they don't have to go through and manually match up "telephone" to "phone number" etc. It even works across languages.


I've got some similar use-cases. So, do I understand correctly that you take the source keyword and generate an embedding vector of it, then compare it using dot-product similarity or something to the embedded vectors of the target keywords?


Exactly, although we use cosine similarity.


Perfect. And yeah that's what I meant, so used to just normalizing vectors so dot product = cosine.


How much time dos this save your users? Is this QOL? Or more of a "our product wouldn't work without this feature" kind of thing?


Quite a bit of time. The product would still work without the feature, but it is a major feature. It bypasses lots of wading through dropdowns (potentially dozens for a single session)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: