Apache Drill 1.0 tears into data, with or without Hadoop

news analysis

May 19, 20152 mins

Drill 1.0 queries Hadoop data via SQL, but may have a life of its own outside of the framework

How many ways can you mine Hadoop data with plain old SQL queries? Lots, but Apache Drill, one of the most versatile of the bunch, has hit its 1.0 milestone, and it’s set to work with more than Hadoop alone.

Drill is an open source implementation of Google’s Dremel/BigQuery engine. It was designed to query multiple kinds of data, including unstructured JSON, structured CSVs, the Apache Parquet format for columnar storage, schemas in Apache Hive‘s Metastore, and more conventional structured data sources.

Aside from needing nothing more than ANSI SQL to run queries and using conventional ODBC/JDBC connectors to allow access to the data, Drill doesn’t require schemas to be defined for the data before querying. This means less involvement from IT to prepare data for analysis; anyone with a suitable tool set and the proper permissions can plug in and begin querying.

Jack Norris, chief marketing officer for MapR — makers of a Google Capital-funded Hadoop distribution that offers Drill as a supported component — described how the 1.0 release was earned by bringing the project to feature completeness for the sake of production environments. The core feature set for Drill has been fixed for some time.

But another key aspect of Drill is its future apart from Hadoop, which is only one of many different kinds of data sources that Drill can query. Many of MapR’s partners who are business intelligence providers, said Norris, are providing their perspective and support for Drill. “At the very least, Drill expands their flexibility,” he said, “and their ability to access some of these new data formats directly, so that it’s bringing self-service [data access] to these systems as well.”

As questions arise about how much enterprise uptake can be realistically expected of Hadoop, it’s prudent for projects conventionally thought of as Hadoop-centric efforts to have second lives. That includes not only Drill, but other Apache data-crunching projects like Spark. If Hadoop’s flame turns out to be bright but short-lived, Drill won’t be lacking for other areas to tear into.

Data Management

by Serdar Yegulalp

Senior Writer

Follow Serdar Yegulalp on X

Serdar Yegulalp is a senior writer at InfoWorld. A veteran technology journalist, Serdar has been writing about computers, operating systems, databases, programming, and other information technology topics for 30 years. Before joining InfoWorld in 2013, Serdar wrote for Windows Magazine, InformationWeek, Byte, and a slew of other publications. At InfoWorld, Serdar has covered software development, devops, containerization, machine learning, and artificial intelligence, winning several B2B journalism awards including a 2024 Neal Award and a 2025 Azbee Award for best instructional content and best how-to article, respectively. He currently focuses on software development tools and technologies and major programming languages including Python, Rust, Go, Zig, and Wasm. Tune into his weekly Dev with Serdar videos for programming tips and techniques and close looks at programming libraries and tools.

Show me more

Topics

About

Policies

Our Network

More

Apache Drill 1.0 tears into data, with or without Hadoop

Drill 1.0 queries Hadoop data via SQL, but may have a life of its own outside of the framework

More from this author

I ran Qwen3.5 locally instead of Claude Code. Here’s what happened.

Migrating Python to Rust with Claude: What could go wrong?

First look: Electrobun for TypeScript-powered desktop apps

What I learned using Claude Sonnet to migrate Python to Rust

The best new features in MariaDB

Python’s popularity slip: Here’s what we know

What is Docker? The spark for the container revolution

First look: Run LLMs locally with LM Studio

Show me more

How to land a software development job in an AI-focused world

The agent security mess

OpenAI’s desktop superapp: The end of ChatGPT as we know it?

How to build desktop apps in Typescript with Electrobun

Write and run assembly in Python with Copapy

Run AI Models Locally on Your PC — No Cloud Required (LM Studio Guide)