Your LLM Doesn't Write Correct Code. It Writes Plausible Code

One of the simplest tests you can run on a database:

Read in full here: