Yuan XU’s Post

A trick in retrieval-augmented generation (RAG) is to use the output of RAG as extra context information to perform second RAG query to get better retrieval result and eventually get a better output. In my interpretation it is a resampling of documents in the vector storage to get more relevant documents as the result of retrieval. And I think we can do the same by querying the vector storage using the embedding output of ReRanker for better retrieval without large cost overhead using text generation. Thanks Paul Tsoi for the paper https://lnkd.in/g8NFdZBg

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics