Credit to @bontchev
Ideally you’d want the layers to not be restricted to LLMs, but rather to include different frameworks that do a better job of incorporating rules or providing an objective output. LLMs are fantastic for generation because they are based on probabilities, but they really cannot provide any amount of objectivity for the same reason.
It’s already been done, for at least a year. ChatGPT plugins are the “different frameworks”, and running a set of LLMs self-reflecting on a train of thought, is AutoGPT.
It’s like:
- Can I stick my fingers in a socket? - Yes.
- What would be the consequences? - Bad.
- Do I want these consequences? - Probably not
- Should I stick my fingers in a socket? - No
However… people like to cheap out, take shortcuts and run an LLM with a single prompt and a single iteration… which leaves you with “Yes” as an answer, then shit happens.