Data Management | News, how-tos, features, reviews, and videos
.NET for Apache Spark 1.0 provides high-performance .NET APIs to Apache Spark including Spark SQL, Spark Streaming, and MLlib
Pairing your on-prem SQL Server with a cloud-based instance for high availability has its challenges, but they can be overcome. Here’s how.
Among the many ways of implementing a GraphQL engine, only one approach offers the same performance, scalability, and ACID guarantees as the underlying database.
MongoDB Atlas database-as-a-service now allows distributed MongoDB databases to span the Amazon, Google, and Microsoft clouds
Microsoft scaled down its flagship database, squeezing it into 500MB and running it on edge hardware.
Kubernetes runs distributed applications, and Apache Cassandra provides a distributed database environment. Here’s how you can run them together
Microsoft and Databricks say the vectorized query engine written in C++ accelerates Apache Spark workloads by up to 20x
Completely community-driven, with no centralized ownership, Postgres has been the elephant in the room for more than 30 years
A new MariaDB storage engine provides distributed SQL and massive scalability with a shared nothing architecture, fully distributed ACID transactions, and strong consistency
A federated SQL query execution engine created at Facebook, Presto brings interactive querying to all of your data — no matter where it resides