1. REFERENCE
    9 min read

    So you want Database Versioning?

    Tim Sehn|

    Here at DoltHub, we've had a lot of success with our "So you want..." series of blog posts helping people find Dolt when they are looking for it. Dolt is a lot of things. Dolt is a version controlled database, a Git database, Git for data, data…

    Read More
Found 20 matching articles.
  1. WEB
    5 min read

    Managing DoltHub Dependencies

    Dolt is Git for data and DoltHub is our web application that houses Dolt repositories. DoltHub consists of three separate React applications: our main Next.js app, as well as two Gatsby apps for our blog and documentation. Our depe...

    Read More
  2. 3 min read

    Performance Benchmarks on Pull Request

    Overview Not long ago we wrote about measuring Dolt's performance against MySQL with the goal of improving Dolt to be no more than 2-4 times slower than MySQL . To work toward this goal, we created a containerized tool that benchma...

    Read More
  3. BOUNTY
    2 min read

    Hospital Price Transparency $10,000 Database Bounty

    On January 1, 2021, a US law was passed requiring hospitals to publish their prices in human and machine readable format. We would like to assemble the best open dataset of hospital prices in the US to aid researchers. To this end, we’re launchi...

    Read More
  4. WEB
    7 min read

    Supporting Larger File Imports on DoltHub

    Introduction Back in November, we announced support for uploading CSV files on DoltHub directly to Dolt repository commits. Since then, we've been quickly iterating on features for upload on the web. We recently released changes to our ...

    Read More
  5. GOLANG
    10 min read

    Optimizing varint Decoding

    Introduction Dolt stores data in a content addressable prolly tree in order to get efficient merges and diffs. In designing the table data format one of our goals was to make table column additions and deletions fast operations. They sho...

    Read More
  6. DATASET
    21 min read

    Pennsylvania ballot data revisited

    Introduction In November, shortly after the election, we published an analysis of Pennsylvania ballot data provided by the Pennsylvania Department of State. The purpose of the analysis was to determine if there was any truth to claims of…

    Read More
  7. DATASET
    2 min read

    December Dataset Spotlight

    We have been running the DoltHub dataset spotlight since May 2020. This is our eighth issue. The intent was to add additional exposure to Dolt datasets published on DoltHub. Publishing this blog monthly has presented some challenges conten...

    Read More
  8. SQL
    15 min read

    Planning joins to make use of indexes

    Introduction Dolt is Git for Data. It's a SQL database that you can clone, fork, branch, and merge. Dolt's SQL engine is go-mysql-server , and today we're going to discuss how it implements join planning to make a query plan involving mul...

    Read More
  9. BOUNTY
    2 min read

    US Presidential Election $25,000 Database Bounty Update

    Last Monday, we released our first data bounty to earn a share of $25,000 by wrangling US Presidential Precinct-level data . This blog will update you on the progress and encourage you to participate. Finally, we'll get a little meta and let y...

    Read More
  10. FEATURE RELEASESQL
    5 min read

    Keyless Tables in Dolt

    Dolt is a tool built for collaboration and data distribution, it's Git for Data . Git versions files, Dolt versions tables. Today, we're announcing support for keyless tables in Dolt. Strongly typed schemas are the best and worst parts of re...

    Read More
  11. BOUNTY
    4 min read

    Bounty Attribution

    On Monday we launched Bounties , a product that pays users to gather and clean data. In less than a week, our first data bounty has already shown the power of Dolt as a collaborative data platform. In that time our bounty has received 22 Pu...

    Read More
  12. BOUNTY
    4 min read

    Introducing Data Bounties

    In 2018, we started the company that is now DoltHub to "create a place on the internet to get access to interesting, maintained data". The data ecosystem of today reminds us a lot of the open source ecosystem of the late 1990s early 2000s. It's ...

    Read More
  13. BOUNTY
    6 min read

    Earn your share of $25,000 building US Presidential Election Database

    Today, we're launching a way to make money building Dolt databases called Bounties . We'll have a follow on blog post Wednesday explaining the motivations for the Bounties feature. But today, we're going to jump right to the chase and explain ...

    Read More
  14. DATASET
    5 min read

    Archiving Presidential Tweets Using Dolt

    Background This is a guest blog post by a member of the DoltHub community, detailing how they went about accumulating presidential tweets in Dolt. We are grateful to our community members for showing us ways of using Dolt we didn’t think of, ...

    Read More
  15. FEATURE RELEASE
    5 min read

    Introducing Dolt CLI in SQL

    Dolt is Git for data, a SQL database with version control tooling. While Dolt is nearing full MySQL compatibility, its current command line interface (CLI) functionality hasn't been accessible in SQL. That means that you can't currently run…

    Read More
  16. 3 min read

    Getting a Mascot for Dolt

    Dolt is Git for Data. It's a SQL database that you can branch, merge, clone, fork, push and pull, just like files in Git. Today we're going to be talking about our quest to get Dolt's branding right, and our first attempt to find a mascot that…

    Read More
  17. 5 min read

    Database Performance: Dolt vs MySQL

    Dolt is a version controlled SQL database. Dolt's query interface is SQL, and it has Git-like version control features. Adding version control features to a SQL database has performance trade offs when comparing Dolt with traditional databases li...

    Read More
  18. DATASET
    2 min read

    November Dataset Spotlight

    Every month we highlight some interesting datasets on DoltHub. The focus is on new or updated datasets but sometimes we shed fresh light on a classic. For those new to Dolt and DoltHub , Dolt is Git for data. Git versions files. Dolt versio...

    Read More
  19. FEATURE RELEASE
    6 min read

    Filter-Branch in Dolt

    Dolt is a tool built for collaboration and data distribution, a SQL database you can branch, merge, diff, clone, fork, push and pull. Today, we're announcing support for filter-branch in Dolt. "Customer focus" is a mantra for our company. In A...

    Read More
  20. 12 min read

    Continuous Deployment with Github Actions: An Example

    Github Actions FTW Not too long ago we endeavored to migrate Dolt's continuous integration pipeline from Jenkins to Github Actions . I wrote a blog about that process and complimented Github Actions on making the migration process int...

    Read More
JOIN THE DATA EVOLUTION

Get started with Dolt

Or join our mailing list to get product updates.