LLM Evals Beyond Accuracy: Harms, Bias, and Cost
Language models, like ChatGPT, are getting smarter every day. But when people talk about them, they often focus on just one thing—accuracy. Does the model give the right answer? Sure, that’s important. But it’s far from the whole story. There’s …
