Confusing Vicky (Vicuna-13B)

Technotica@lemmy.world · edit-2 1 year ago

Confusing Vicky (Vicuna-13B)

mo_ztt ✅@lemmy.world · edit-2 1 year ago

So – the behavior you saw is actually how LLMs are supposed to behave.

The core of any LLM is just predicting what word comes next. That’s it. It’ll happily have a conversation with itself, or invent more or less anything to come next, because (unless there’s a web interface that gives it one) it has no concept of “your part” of the conversation versus “its part.”

Models like InstructGPT have been constructed to massage that core functionality into something that can talk with you or obey commands (“Answer in the style of…”), but that’s not really how they operate at the core. It’s a hack that makes it more understandable to humans. But the core functionality of just coming up with a logical completion is still 90+% of what it’s doing when you interact with it. ChatGPT does an excellent job of creating the illusion that it’s a personality, and obeying what you tell it to do as a counterpart, but that’s only because of excellent engineering on OpenAI’s part. A lot of the less well-refined models behave a lot more like just a language-completion machine and less like a conversational partner.

If you’re trying to get something done with an LLM (especially one that’s not made by OpenAI), it’s actually beneficial to think of it that way. E.g. instead of saying “Answer in the style of…”, just tee it up by showing a few previous lines of conversation among two parties where you’re illustrating what you want it to do, and let that interaction “soak into” the language model a little bit, and then when you ask it to complete the next statement, it’ll often do way better than if you’d described in all this detail what you wanted. Because that’s its core functionality. The whole thing where it’s a counterpart and having a conversation with you is sort of a hack that’s been fine-tuned on top of that, to make it easier and more impressive for people to interact with.