Paul Krill
Editor at Large

Apache moves on traffic server, machine learning projects

news
May 4, 20102 mins

For the first time, the open source organization is announcing six top-level efforts

The Apache Software Foundation, developer of open source software, on Tuesday is announcing the creation of six Top-Level Projects, including the Apache Traffic Server for caching and Apache Mahout, implementing machine-learning algorithms atop the Apache Hadoop distributed computing platform.

This is the first time Apache has announced six Top-Level Projects at the same time; Top-Level Project status signifies the highest level a project can reach at the organization.

[ See InfoWorld’s report on the recent hack of an Apache project server. ]

Traffic Server is a former commercial project from Yahoo, submitted as an Apache incubator project last year. Suitable for providing edge services in cloud computing, it can serve static content, such as images and JavaScript. Able to process more than 75,000 requests per second, Traffic Server also can route requests for dynamic content to a Web server.

“Becoming a Top-Level Project is a vote of confidence from the foundation at-large, demonstrating a project has proven its ability to be properly self-governed,” said ASF chairman Jim Jagielski in a statement released by the foundation.

Mahout, a former Apache sub-project, offers collaborative filtering, clustering, classification, and data mining algorithms.

Other former sub-projects moving to Top-Level status include:

  • Tika, which is an embeddable toolkit for content detection and analysis.
  • Nutch, a modular Web searching engine.
  • Avro, a fast data serialization system.
  • HBase, a distributed database modeled after Google’s Bigtable distributed storage system.

HBase and Avro are former subprojects of Hadoop, while Mahoot, Nutch, and Tika formerly were sub-projects of the Lucene search engine effort.

Other Top-Level Projects formed at Apache this year include UIMA (Unstructured Information Management Architecture), providing a framework for analyzing unstructured information; Cassandra, a second-generation “NoSQL” distributed data store; and Click, a Java EE Web application framework.

Apache this year also has accepted the Subversion versioning control system as a Top-Level Project this year, along with Shindig, a container for hosting OpenSocial applications.

This article, “Apache moves on traffic server, machine learning projects,” was originally published at InfoWorld.com. Follow the latest developments in business technology news and get a digest of the key stories each day in the InfoWorld Daily newsletter and on your mobile device at infoworldmobile.com.

Paul Krill

Paul Krill is editor at large at InfoWorld. Paul has been covering computer technology as a news and feature reporter for more than 35 years, including 30 years at InfoWorld. He has specialized in coverage of software development tools and technologies since the 1990s, and he continues to lead InfoWorld’s news coverage of software development platforms including Java and .NET and programming languages including JavaScript, TypeScript, PHP, Python, Ruby, Rust, and Go. Long trusted as a reporter who prioritizes accuracy, integrity, and the best interests of readers, Paul is sought out by technology companies and industry organizations who want to reach InfoWorld’s audience of software developers and other information technology professionals. Paul has won a “Best Technology News Coverage” award from IDG.

More from this author