Serdar Yegulalp
Senior Writer

Hadoop project ODP regroups under Linux Foundation’s umbrella

news analysis
Mar 29, 20163 mins

The Open Data Platform's reorg aims to assuage criticism about vendor control over the initiative to create a consistent baseline Hadoop distribution

Elephant dog rain tint
Credit: Thinkstock

After major criticism within the Hadoop community regarding its nature and aims, Open Data Platform — an initiative to create a reference-standard Hadoop distributionannounced Monday it will now be hosted at the Linux Foundation as a Collaborative Project.

The goal with this new organizational structure is to ameliorate the perception that the ODP Initiative is vendor-owned and vendor-controlled, rather than just participated in by Hadoop vendors among many others.

Less regress, more progress

John Mertic, senior program manager for the ODPi, explained in a phone call that the main goal of the project is to bring together representatives from different parts of the Hadoop world — not only the distributions themselves, but the makers of tool sets, the ISVs, and the customer solution providers — and remove both the duplication of effort and the divisions that have arisen between the parties deploying Hadoop.

Most of the issues, Mertic explained, revolve around the way each distribution implements common parts of Hadoop in slightly differently. “Things as simple as filenaming strategies; environment variable exposure; not changing base, core, public APIs — each vendor had a slightly different approach,” Mertic said. They’ve proven to be a hindrance to customers, he said.

What’s to ensure that the ODPi doesn’t slip back into the hands of the vendors that work with it?  “For any decision in regards to what the focuses of a spec should be, what projects we look to include or remove, all of that is entirely driven by the vendors,” Mertic said. “If a member has a project they want to add, they pass it upstream to our release team.” The latter is composed of both technical and business members.

The release team sets priorities for the requested items and determines what a release for such features will look like. The results are then passed back to the vendors for a vote, where each member has a single ballot, regardless of size.

Pulling this process under the umbrella of the Linux Foundation provides “a way for vendors to engage and work together and collaborate in a vendor-neutral environment,” without resorting to as “committer proxy war” strategies at the project level.

Under a new umbrella

Placing the ODP under the aegis of the Linux Foundation, rather than the Apache Foundation, further distinguishes the initiative’s goals. “The Apache Foundation is build more around individual contributors and projects,” Mertic explained, implying that it’s mainly about development — “a place to play for people who are deep into these toolsets.”

People who want to address their specific user cases, such as analytics or SaaS vendors, want to find a way to do that without dealing with specific projects, Mertic said. The ODP’s work is thus intended to complement the Apache Foundation’s work rather than replace it.

“[The Apache Foundation] has paths for individual contributors, but it doesn’t really have paths for companies,” Mertic said. Most of that is accomplished by having employees from those companies contribute directly to projects. The ODPi, then, is meant to be “a different level of engagement for a different sort of audience,” Mertic said.

Serdar Yegulalp

Serdar Yegulalp is a senior writer at InfoWorld. A veteran technology journalist, Serdar has been writing about computers, operating systems, databases, programming, and other information technology topics for 30 years. Before joining InfoWorld in 2013, Serdar wrote for Windows Magazine, InformationWeek, Byte, and a slew of other publications. At InfoWorld, Serdar has covered software development, devops, containerization, machine learning, and artificial intelligence, winning several B2B journalism awards including a 2024 Neal Award and a 2025 Azbee Award for best instructional content and best how-to article, respectively. He currently focuses on software development tools and technologies and major programming languages including Python, Rust, Go, Zig, and Wasm. Tune into his weekly Dev with Serdar videos for programming tips and techniques and close looks at programming libraries and tools.

More from this author