Artificial Intelligence | News, analysis, features, how-tos, and videos
Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Open source has always had issues, but the benefits outweighed the costs/risks. AI is not merely exponentially accelerating tasks, it is disproportionately increasing risks.
A technical preview promises to take on the unrewarding work in DevOps, but questions remain about controls over costs and access.
Serverless integration and GPU efficiency become central as Mistral expands beyond models into enterprise AI infrastructure.
Noname API security optimized for greater performance and lower business costs with 3rd Gen Intel® Xeon® Scalable processors.
Latest update to Anthropic’s popular AI model also promises improvements for computer use, long-context reasoning, agent planning, knowledge work, and design.
W3C proposal backed by Google and Microsoft allows developers to expose client-side JavaScript tools to AI agents, enabling collaborative workflows between users and agents within the same web interface.
The new model claims benchmark improvements and agent capabilities as competition among Chinese AI vendors accelerates.
AI agent OSS activity opens the door to future supply chain attacks, says security company.
Peter Steinberger will lead personal agent development, while the viral open-source project will continue under an open-source foundation.