Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In practice, it depends on the dataset size and use case. For web search? Mostly worthless, but can be a valuable signal to train ML. For small corpus of documents? BM25 alone does a pretty good job in general.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: