https://eugeneyan.com/writing/llm-experiments/

Question

https://eugeneyan.com/writing/llm-experiments/

utterances-bot opened this issue a year ago · comments

utterances bot commented a year ago

Experimenting with LLMs to Research, Reflect, and Plan

Also, shortcomings in document retrieval and how to overcome them with search & recsys techniques.

https://eugeneyan.com/writing/llm-experiments/

Ian Danforth · Answer 1 · Tue Apr 11 2023 23:08:29 GMT+0800 (China Standard Time)

"If we use exact nearest neighbours, we would get perfect recall of 1.0 but with higher latency (think seconds)."

Exact search is performant up to tens of thousands of documents / vectors. Is the document store you're embedding really that large?

Eugene Yan · Answer 2 · Wed Apr 12 2023 12:37:55 GMT+0800 (China Standard Time)

Not currently. Nonetheless, this is meant to scale to larger and far more documents, such as books and papers (idea here).

Thomas Ebermann · Answer 3 · Thu Apr 13 2023 17:12:10 GMT+0800 (China Standard Time)

where is the source code for that?

Kevin Leneway · Answer 4 · Thu Apr 13 2023 20:21:12 GMT+0800 (China Standard Time)

Great read, nice work on this. Thanks for highlighting the issues with retrieval. I’ve recently started building an LLM-powered assistant type app and was surprised to see how difficult the retrieval step is. There are lots of blog posts out there about how to set up a vector search tool but very few about how to optimize and troubleshoot queries and embeddings.

Eugene Yan · Answer 5 · Tue Apr 18 2023 05:51:31 GMT+0800 (China Standard Time)

where is the source code for that?

Currently private. It’s a mess and I’m embarrassed lol 🙈 Also needs to be scrubbed of credentials.