- REFERENCE9 min read
So you want Database Versioning?
Here at DoltHub, we've had a lot of success with our "So you want..." series of blog posts helping people find Dolt when they are looking for it. Dolt is a lot of things. Dolt is a version controlled database, a Git database, Git for data, data…
Read More
- PYTHON5 min read
Doltpy 1.0.0
Background Dolt is a SQL database that stores data in a commit graph, and offers a Git-like interface for management. It offers a command-line-interface (CLI) that provides managing database level considerations such as how and where to sta...
Read More - 5 min read
Learn SQL with Real Data using Dolt
Dolt is a SQL database with Git-style versioning. DoltHub is a place on the internet to share Dolt databases. We think these tools can help people learn and perfect their SQL skills like no other database. This blog explains how. Get sta...
Read More - DATASET2 min read
May Dataset Spotlight
Every month we highlight some interesting datasets on DoltHub. The focus is on new or updated datasets but sometimes we shed fresh light on a classic. For those new to Dolt and DoltHub , Dolt is Git for data. Git versions files. Dolt versio...
Read More - REFERENCEWEB4 min read
Delivering Declarative Data to DoltHub with GraphQL
DoltHub is GitHub for data. As you might imagine, the data-fetching needs on the front end of such an application are intense. In the previous article in this series , we saw how working directly with our gRPC API was making our front-end...
Read More - 7 min read
Extending SQL Sync to Postgres
Background Dolt is Git for data. It is a relational database that implements a storage layout similar to a commit graph, allowing users to clone, branch, and merge structured data. We believe the ability to clone and pull a remote dataset, an...
Read More - FEATURE RELEASESQL5 min read
Introducing Secondary Indexes
Dolt is a SQL database with Git-style versioning. We're constantly adding new and exciting SQL features, and secondary indexes are one of them! This blog goes over what they are, why they're useful, and how they're implemented in Dolt . W...
Read More - USE CASE6 min read
Dolt as an Application Server
A question we have been asked numerous times is, "Can Dolt be used as an application server"? This has driven a lot of conversations internally about the use cases of a versioned database server, and led to some very technical discussions about…
Read More - 5 min read
Distribute Data with Dolt, not APIs
Application Programming Interfaces (APIs) are the dominant mode of distributing data on the internet . Twitter debates in the data science community about Comma Separated Value (CSV) files vs APIs have flared up lately. We think both of these ...
Read More - REFERENCEWEB8 min read
How GraphQL Saved Us from the gRPC Dumpster Fire We Created
DoltHub is the online data community powered by Dolt , the version-controlled SQL database. In the previous article in this series, we took an overview of DoltHub's front-end architecture. In this article, we'll take a look at the pit of sad...
Read More - REFERENCE5 min read
The Dolt Commit Graph and Structural Sharing
Dolt is a SQL database that provides Git-like functionality, including clone, push, pull, branch, and merge. This post is part of a series exploring how Dolt stores table data. In our previous post, How Dolt Stores Table Data , we explored a un...
Read More - 10 min read
Using Dolt to Manage Train/Test Splits
Twitter is wonderful sometimes. We don't know Aaron. He finds us on Twitter, asks a great question, makes us think, and prompts a blog post. How can you use Dolt to manage train/test splits for your Machine Learning m...
Read More - INTEGRATION6 min read
Using Dolt with the JetBrains DataGrip SQL Workbench
Dolt has been rapidly expanding its capabilities as a SQL server recently. We've done a lot of work to get the dolt sql-server command to be a stable peer to the built-in SQL shell, with all the same capabilities. In the last month we've e...
Read More - SQL6 min read
Joining Multiple Repositories with SQL Queries
In our blogs we have shown over and over again how easy it is to clone data from DoltHub and immediately start querying it with SQL. We are constantly working on improving our data catalog. As we do, there emerge more occasions where you can der...
Read More - 2 min read
Adopting go-mysql-server
go-mysql-server is the SQL query execution engine that powers Dolt and DoltHub . Today we are excited to announce that we are adopting the project after its founding company ceased operations. Our fork of the project has over 400 ad...
Read More - DATASET3 min read
April Dataset Spotlight
This blog entry is the first in a new series. Every month we will highlight some interesting datasets on DoltHub. The focus will be on new or updated datasets but sometimes we'll shed fresh light on a classic. For those new to Dolt and Dolt...
Read More - 10 min read
Dolt and DoltHub: Publish Using CSVs
Dolt is a SQL database with Git-style versioning. DoltHub is a place to share Dolt repositories. Dolt is Git for data. DoltHub is GitHub for Dolt. We want to host your public data on DoltHub. We think Dolt and DoltHub provide the best sharing ...
Read More - 4 min read
Introducing Dolt to SQL sync
Background While building Dolt and DoltHub, we have had many conversations with our users. They all share an interest in finding better ways to manage data. They recognize that writing code to massage CSV, JSON, and other less well known form...
Read More - 5 min read
Using Dolt to Find Test Regressions
Dolt is Git for data. It's a database that lets you clone, fork, branch, merge and diff. This is a really cool technology that has a lot of uses , but today we're going to focus on just one: using Dolt SQL to find regressions in test resul...
Read More - DATASET3 min read
Common Vulnerabilities and Exposures in Dolt
TLDR: The NVD is a lot more useful when you can simply clone it and query it. The National Vulnerability Database (NVD) is the authoritative source for the publication of Common Vulnerabilities and Exposures (CVE). The vulnerabilities cata...
Read More - DATASET12 min read
28 grams of Cannabis Data Sets
Happy 4/20! Today is April 20th, the unofficial holiday of marijuana afficionados the world over. Happy 4/20! Or, as we in the data business like to say, Happy 20%! Recreational marijuana has been legal...
Read More