Show HN: A new benchmark for testing LLMs for deterministic outputs

by khurdula | View on Hacker News