Yuan XU’s Post

Job Hunter 😂

7mo

A trick in retrieval-augmented generation (RAG) is to use the output of RAG as extra context information to perform second RAG query to get better retrieval result and eventually get a better output. In my interpretation it is a resampling of documents in the vector storage to get more relevant documents as the result of retrieval. And I think we can do the same by querying the vector storage using the embedding output of ReRanker for better retrieval without large cost overhead using text generation. Thanks Paul Tsoi for the paper https://lnkd.in/g8NFdZBg

To view or add a comment, sign in

More Relevant Posts

Hamid Reza Zamanian [UNV★MBA★PMP]

📊📈 BI Data Manager at 🌐 United Nations Development Program, USA 🇺🇲 (Remote)
8mo
Report this post
Combine Text Embeddings and Knowledge (Graph) Embeddings in RAG systems https://lnkd.in/dYXSY5EN

Combine Text Embeddings and Knowledge (Graph) Embeddings in RAG systems

towardsdatascience.com
Like Comment
To view or add a comment, sign in
Surya Putchala

Applied AI/ML Expert | I help organizations from AI Strategy & Solutioning to Execution | Generative AI Consultant | 2X Founder, 2 Exits with $40MM+ M&A valuation
8mo
Report this post
The convergence of graphs, networks, AI...how cool is it? Thanks Sunila Gollapudi for writing this nice article!!

Sunila Gollapudi

Engineering Leader, Google | Enterprise Data, Cloud & Architecture | Knowledge Scientist | Author
8mo

Here is a detailed article on how I evaluated and combined the text embeddings and knowledge (graph) embeddings and leveraged in RAGs. It has 4 parts, Part 1: What are Text embeddings (TE) & how are they stored and used in the RAG implementation? Part 2: What are Knowledge (Graph) embeddings (KGE) & How are they stored? Part 3: How are Knowledge (Graph) Embeddings different from Text Embeddings, and analyze if they are complementary in the context of usage in RAG ? Conclusion: Benefits of combining embeddings and overall summary https://lnkd.in/ggh9QUer

Combine Text Embeddings and Knowledge (Graph) Embeddings in RAG systems

medium.com
Like Comment
To view or add a comment, sign in
Waqas Ahmed

Technical Lead at DPL Pvt. Ltd
1mo
Report this post
Pretty excited about this new RAG technique 🧑🍳 A top issue with RAG chunking is it splits the document into fragmented pieces, causing top-k retrieval to return partial context. Also most documents have multiple hierarchies of sections: top-level sections, sub-sections, etc. This is also why lots of people are interested in exploring the idea of knowledge graphs - pulling in "links" to related pages to expand retrieved context. This notebook lets you retrieve contiguous chunks without having to spend a lot of time tuning the chunking algorithm, thanks to GraphRAG-esque metadata tagging + retrieval. Tag chunks with sections, and use the section ID to expand the retrieved set. #RAGTechnique #GraphRAG #KnowledgeGraphs #AIContextualUnderstanding #InformationRetrieval #NaturalLanguageProcessing #ChunkingOptimization #ArtificialIntelligenceInnovation #MachineLearningAdvancements #LanguageModelingSolutions https://lnkd.in/gqTnfKWG

Jerry Liu (@jerryjliu0) on X

x.com
Like Comment
To view or add a comment, sign in
Weaviate

26,786 followers
5mo
Report this post
In this blog, Zain Hasan breaks down RAG into indexing, retrieval, and generation components and proposes 2 to 3 practical steps to improve each part of your RAG pipeline. Covering everything from chunking techniques, filtered search, and hybrid search to reranking, fine-tuning embedding models, and generating metadata for your text chunks! https://lnkd.in/dqfKWfiu
Like Comment
To view or add a comment, sign in
Sunila Gollapudi

Engineering Leader, Google | Enterprise Data, Cloud & Architecture | Knowledge Scientist | Author
8mo
Report this post
Here is a detailed article on how I evaluated and combined the text embeddings and knowledge (graph) embeddings and leveraged in RAGs. It has 4 parts, Part 1: What are Text embeddings (TE) & how are they stored and used in the RAG implementation? Part 2: What are Knowledge (Graph) embeddings (KGE) & How are they stored? Part 3: How are Knowledge (Graph) Embeddings different from Text Embeddings, and analyze if they are complementary in the context of usage in RAG ? Conclusion: Benefits of combining embeddings and overall summary https://lnkd.in/ggh9QUer

Combine Text Embeddings and Knowledge (Graph) Embeddings in RAG systems

medium.com

5 Comments
Like Comment
To view or add a comment, sign in
Eyal Maor

Entrepreneur | Board Member | Executive
8mo
Report this post
We use AI today as if it can solve any type of problem "out of the box". When it comes to internal data it is becoming far challenging to understand how to use AI and which technologies that can we leverage with our internal enterprise data to get much more out of it in a seamless way . See the following to get some interesting ideas

Sunila Gollapudi

Engineering Leader, Google | Enterprise Data, Cloud & Architecture | Knowledge Scientist | Author
8mo

Here is a detailed article on how I evaluated and combined the text embeddings and knowledge (graph) embeddings and leveraged in RAGs. It has 4 parts, Part 1: What are Text embeddings (TE) & how are they stored and used in the RAG implementation? Part 2: What are Knowledge (Graph) embeddings (KGE) & How are they stored? Part 3: How are Knowledge (Graph) Embeddings different from Text Embeddings, and analyze if they are complementary in the context of usage in RAG ? Conclusion: Benefits of combining embeddings and overall summary https://lnkd.in/ggh9QUer

Combine Text Embeddings and Knowledge (Graph) Embeddings in RAG systems

medium.com
Like Comment
To view or add a comment, sign in
LlamaIndex

229,977 followers
9mo
Report this post
There’s thousands of RAG techniques and tutorials, but which ones perform the best? ARAGOG by Matouš Eibich is one of the most comprehensive evaluation surveys on advanced RAG techniques, testing everything from “classic vector database” to reranking (Cohere, LLM) to MMR to LlamaIndex native advanced techniques (sentence window retrieval, document summary index). The findings 💡: ✅ HyDE and LLM reranking enhance retrieval precision ⚠️ MMR and multi-query techniques didn’t seem to be as effective ✅ Sentence window retrieval, Auto-merging retrieval, and the document summary index (all native LlamaIndex techniques) offer promising benefits in either retrieval precision and answer similarity! (And also interesting tradeoffs). It’s definitely worth giving the full paper a skim. Check it out: https://lnkd.in/genni8g2
39 Comments
Like Comment
To view or add a comment, sign in
SmartBots AI

4,883 followers
9mo
Report this post
A comprehensive study of RAG techniques adds so much value to the Generative AI solutions ecosystem ....
LlamaIndex

229,977 followers
9mo

There’s thousands of RAG techniques and tutorials, but which ones perform the best? ARAGOG by Matouš Eibich is one of the most comprehensive evaluation surveys on advanced RAG techniques, testing everything from “classic vector database” to reranking (Cohere, LLM) to MMR to LlamaIndex native advanced techniques (sentence window retrieval, document summary index). The findings 💡: ✅ HyDE and LLM reranking enhance retrieval precision ⚠️ MMR and multi-query techniques didn’t seem to be as effective ✅ Sentence window retrieval, Auto-merging retrieval, and the document summary index (all native LlamaIndex techniques) offer promising benefits in either retrieval precision and answer similarity! (And also interesting tradeoffs). It’s definitely worth giving the full paper a skim. Check it out: https://lnkd.in/genni8g2
1 Comment
Like Comment
To view or add a comment, sign in
Tafar M.

Data Scientist | AI/ML Practitioner {Specializing in AI & ML Pipelines} | Database {SQL & NoSQL Expertise} ● Predictive Maintenance & Digital Twin Technology
4mo
Report this post
Graph RAG Works Better Than Standard RAG #GraphRAG leverages structural information across entities to enable more precise and comprehensive retrieval, capturing relational knowledge and facilitating more accurate, context-aware responses. This improves the accuracy of standard RAG systems.
Like Comment
To view or add a comment, sign in
Xiangyu AN

AI Project Manager | @AFNOR Groupe & Inria | Gen AI, Conversational agent
9mo
Report this post
Well, it's true that people get some better results by applying these techniques, but for each use case, it is still important to have enough tests to find the best way for his/her best approach. The application and effect of new technologies is not really linear. Practice is the best way to prove. Thank you for sharing the paper!
LlamaIndex

229,977 followers
9mo

There’s thousands of RAG techniques and tutorials, but which ones perform the best? ARAGOG by Matouš Eibich is one of the most comprehensive evaluation surveys on advanced RAG techniques, testing everything from “classic vector database” to reranking (Cohere, LLM) to MMR to LlamaIndex native advanced techniques (sentence window retrieval, document summary index). The findings 💡: ✅ HyDE and LLM reranking enhance retrieval precision ⚠️ MMR and multi-query techniques didn’t seem to be as effective ✅ Sentence window retrieval, Auto-merging retrieval, and the document summary index (all native LlamaIndex techniques) offer promising benefits in either retrieval precision and answer similarity! (And also interesting tradeoffs). It’s definitely worth giving the full paper a skim. Check it out: https://lnkd.in/genni8g2
1 Comment
Like Comment
To view or add a comment, sign in

141 followers

View Profile Follow

Yuan XU’s Post

More from this author

Leveraging Digital Tools to Enhance Citizen Participation and Democratic Decision-Making

Fix My Street and Polis: A Powerful Partnership for Citizen Engagement

Gaza War chatbot project closure

Explore topics