Paul Krill
Editor at Large

Web Codegen Scorer evaluates AI-generated web code

news
Sep 22, 20252 mins

Google’s Angular team has open-sourced a tool that evaluates the quality of web code generated by LLMs. It works with any web library or framework.

Credit: Shutterstock/TippaPatt

Google’s Angular team has unveiled Web Codegen Scorer, a tool for evaluating the quality of web code generated by LLMs (large language models).

Introduced September 16, Web Codegen Scorer focuses on web code generation and comprehensive quality evaluation, Simona Cotin, senior engineering manager for Angular, wrote in a blog post. Cotin noted that the tool helped the Angular team create the fine-tuned prompts, available at angular.dev/ai, that optimize LLMs for the framework. The tool also helps the team to better integrate application features and syntax as the framework evolves, she said.

Web Codegen Scorer can be used to make evidence-based decisions pertaining to AI-generated code. Developers, for example, could iterate on a system prompt to find the most-effective instructions for a project, compare quality of code produced by different models, and monitor generated code quality as models and agents evolve. Web Codegen Scorer is different from other code benchmarks in that it focuses on web code and relies primarily on well-established measures of code quality, Cotin said.

Web Codegen Scorer can be used with any web library or framework, or none at all, as well as with any model. Instructions on installing Web Codegen Scorer can be found on GitHub.

Specific capabilities include:

  • Configuring evaluations with different models, frameworks, and tools.
  • Specifying system instructions and adding MCP (Model Context Protocol) servers.
  • Built-in checks for build success, runtime errors, accessibility, security, LLM rating, and coding best practices.
  • Automatic attempts to repair issues detected during code generation.
  • Viewing and comparing results with a report viewer UI.
Paul Krill

Paul Krill is editor at large at InfoWorld. Paul has been covering computer technology as a news and feature reporter for more than 35 years, including 30 years at InfoWorld. He has specialized in coverage of software development tools and technologies since the 1990s, and he continues to lead InfoWorld’s news coverage of software development platforms including Java and .NET and programming languages including JavaScript, TypeScript, PHP, Python, Ruby, Rust, and Go. Long trusted as a reporter who prioritizes accuracy, integrity, and the best interests of readers, Paul is sought out by technology companies and industry organizations who want to reach InfoWorld’s audience of software developers and other information technology professionals. Paul has won a “Best Technology News Coverage” award from IDG.

More from this author