Noodling About - Experiments, Writing & More

Experimenting with GenAI, LLMs & RAG - Part 2: When Are LLMs Useful?

Introduction

So in Part One: Experimenting with GenAI, LLMs & RAG - Part 1: Building a RAG Application, I touched upon what "Retrieval-Augmented Generation (RAG)" is, and discussed a little RAG application that I'd built: see https://dnalorroland.pythonanywhere.com/ns/ftse?tab=qa

Does RAG Help Overcome the Limitations of LLMs?

In short:

Who is Oxfam's current chief executive?

Was There More I Could Do?

Well, it seems that there probably were more things I could try - see "DiamantAI: How to Stop AI Hallucinations: 10 Proven Techniques That Actually Work". That's really at least ten techniques.

How to Stop AI Hallucinations: 10 Proven Techniques That Actually Work

Interestingly, I was already doing some of these, namely:

The Real Problem: This Is Not a Good Use Case for LLMs

I think the reason I haven't persisted with this any further is because being precise fundamentally isn't what LLMs are good at. In reality when you ask an LLM a question the core engine can give you thousands of answers in decreasing levels of relevance to your query, but that doesn't make a good product.

If we ask a chatbot for a summary, we want certainty "hey, no problem, here's your summary". We don't want "here's what I think is the most relevant summary, but you should check it" - particularly if we're summarising a 200 page PDF. So this certainty is what the platform providers have built us, but that presentation does seem somewhat disengenous.

LLMs are by their nature probablistic, and so any answer is always just one of an infinite number of answers determined by the LLM to be the most relevant to your query.

Which does make me a little nervous about this:

Musk's Grok signs $200m deal with Pentagon days after antisemitism row

Still, I'm sure they'll work it out in high-pressure military situations where certainty is rather important 🫣

What Are Good Use Cases for LLM?

So this begs the question - what are LLM's useful for?

Returning to My Original Intention:

"Could one get an LLM to give an accurate analysis of a company account, and explain why it gave that analysis?"

Given this, I've decided not to take this project much further.

Further Reading:

I find Gary Marcus well worth reading. Here's a good example:

Game over for pure LLMs. Even Turing Award Winner Rich Sutton has gotten off the bus.

Finally, via my brother Joe, a terrific example of a Linked In Recruitment agent being tricked into generating a flan recipe in its speculative email having scraped a profile:

Recruitment Agent generated a flan recipe

What's Next?

Previous Post | Next Post