Single-prompt HTML generation challenges testing an LLM's ability to create self-contained web applications, games, visualizations, and use JavaScript libraries.
Each model receives one prompt and must generate a complete, working HTML file. No multi-turn conversation, no tool use -- just raw code generation ability. Tests span four categories: