How it works in practice
Google CloudRAG (Retrieval-Augmented Generation) is the most common grounding technique. Before the LLM answers a question, a search system retrieves relevant PDF snippets or database rows. The LLM is then instructed: "Answer the user's question using ONLY the following retrieved snippets."