**yan** @svmihar@qoto.org · 2020-07-22T09:17:26Z

yan @svmihar@qoto.org

it's too bad DPR dataset are TREC, which mean it's still a supervised learning based :( hoping something like beam search like on google's meena (the awesome open domain chatbot)

https://arxiv.org/pdf/2004.04906.pdf

Jul 22, 2020, 09:17 · · toot - a Mastodon CLI client · · ·

**yan** @svmihar@qoto.org · Jul 22, 2020, 09:21

**yan** @svmihar@qoto.org · Jul 22, 2020, 09:21

Jul 22, 2020, 09:21

yan @svmihar@qoto.org

and it uses FAISS, the infamous nmap indexer.

approximate vector search (neighbor search ones, like annoy/hnsw/milvus), especially on <10k docs is actually shitty (it'll return random shit, way out of the query context). despite what word / sentence embedding used.

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…