Spark and big memory have the potential to run big data workloads without copying gobs of data to a new storage infrastructure
Hadoop jobs can get complicated. The open source ETL tool Kettle beats the alternatives in providing the orchestration you need
Already the hottest thing in big data, Spark 1.6 turns up the heat. Here are the high points, including improved streaming and memory management
Amazingly, Hadoop has been redefined in the space of a year. Let's take a look at all the salient parts of this roiling ecosystem and what they mean
Yes, it's the poignant sequel to <a href="/article/3013229/application-development/10-things-you-dont-need-to-worry-about-in-2016.html">last week's reprieve</a>: a jolly list of worries to keep ....
Are you ready for 2016? Of course you aren't -- you don't even want to think about it. If you need excuses for your lack of forethought, read on
Here's all you need to know about Angular 2, the exciting new successor to Google's wildly popular JavaScript framework, AngularJS
Spark may have taken big data by storm, but IBM's promise to train a million people on Spark goes too far, mainly because the technology is evolving too fast
Spark has dethroned MapReduce and changed big data forever, but that rapid ascent has been accompanied by persistent frustrations
Joyent started the container party, later validated by Docker. Despite superior technology, does an independent public cloud like Joyent have a chance?
Old models of computing always tend to linger too long, but client-server was based on a fallacy -- and needs to go away sooner rather than later
A survey from Dell says that growth companies are turning to big data in droves
From containers to NoSQL to Spark, here are the IT trends you can expect to persist next year
The once red-hot database technology is losing its luster, as NoSQL reaches mass adoption
Empower users to acquire and manipulate the data they really need, and BI can become a magnitude more useful -- and much less work for IT
It's official: The Hadoop ecosystem has received a brain transplant. Here's how Cloudera, the leading Hadoop vendor, laid out the implications of swapping MapReduce for Spark
The components of the Hadoop ecosystem won't overthrow Teradata or IBM Netezza any time soon, but ultimately, the commodity solution almost always wins
A new poll of customers provides a brighter, more detailed picture of Hadoop adoption than Gartner's famously downbeat survey
The marijuana business is making the leap from paper ledgers to the cloud and big data analytics, offering a provocative example for us all
We all have our reasons for quitting a job, some of them emotional. Follow these guidelines and you'll avoid making moves you'll later regret
Data governance is one of the toughest, dreariest problems in computing. Sadly, the tools offered with the major Hadoop distributions aren't really up to task
The Hadoop ecosystem has always been a bag of parts, each of which needs to be secured separately -- at least they did need that, until Apache Ranger came to town
Think you're breaking new ground with your Hadoop project? Odds are it fits neatly into one of these seven common types of projects
Do enough Hadoop and NoSQL deployments, and the same problems crop up again and again. It's time for the industry to nail them sooner rather than later
Analytics drive decisions, but some decisions shouldn't wait until batch processes complete -- which is why, eventually, we'll all analyze data as it streams in
These four truths will help you determine which Hadoop technology to use for the types of workloads you anticipate
Fresh from the front lines: Common problems encountered when putting Hadoop to work -- and the best tools to make Hadoop less burdensome
The NoSQL trend has given us a crazy array of new database choices. Clusterpoint has just jumped in to offer a cloud-based document database
These unfounded beliefs about budget skills, technology, and technology fit can lead you astray
Sure, a NoSQL or JSON data warehouse sounds faddish, but SonarW is a better solution for many
MongoDB World 2015 introduces interesting new features in MongoDB 3.2 and more interesting questions about the future of the company and its community
With recent enhancements, you can now get a truly useful education from Coursera and its ilk — in data science, machine learning, and many other subjects.
Oh no! Big data is failing because we can't find enough people who know the technology! Relax, they're out there -- but don't fall for the buzzwords
Database virtualization, a seemingly bad idea from the past, turns out to be a good idea in the present
A flexible replacement for Hadoop MapReduce that supports real-time and batch processing, Flink offers advantages over Spark
Dear corporations: We've already ceded our privacy. Now implement the technology to serve us right
Outside of programmatic trading and fraud detection, we're only starting to use machine learning in business. The biggest inhibitor may be our lack of imagination in applying it
Soon, we'll see 'prepacked' applications that incorporate the distributed processing, machine learning, and analytics of today's overhyped, custom-made solutions
Spark is the hottest project in big data -- but Databricks, the company behind it, needs to ensure its implementation has a plausible path to maturity
A $99 device and a Spark back end creates an ecosystem of car-connected data and applications
PaaS is finally getting traction, and when it takes hold, the lives of developers will change forever
Ten months ago, we published a cheat sheet for learning about Hadoop, the center of the big data vortex. Check out what's been added since then
When combined with scanners, today's 3D printers do more than mold figurines out of plastic goop. They're closer to displacing a broad swath of everyday jobs than you think
The open data movement purports to cultivate an informed citizenry and rescue government offices from the dark ages -- but can it tell me which streets to avoid on my bike?
Now that Pivotal has partnered with Hortonworks and open-sourced Big Data Suite, the future of the business looks to be in Cloud Foundry's court
Data isn't the real world, it only describes it. A new company wants to let you drill past the numbers to lifelike, real-time representations of what's going on
The new MongoDB features document-level locking, better write performance, big memory support, and more. At last, MongoDB is all grown up
The Internet of things seems futuristic, but real systems are delivering real analytics value today. Here's some real-world IoT advice from the field
We live in an age of uncertainty, where old assumptions suddenly become open questions. Generalized anxiety is bad for you, though, so focus on these 10 points
We all have enough to worry about. Read this list and you'll have 10 fewer reasons to stay up all night