1. REFERENCE
    9 min read

    So you want Database Versioning?

    Tim Sehn|

    Here at DoltHub, we've had a lot of success with our "So you want..." series of blog posts helping people find Dolt when they are looking for it. Dolt is a lot of things. Dolt is a version controlled database, a Git database, Git for data, data…

    Read More
Found 20 matching articles.
  1. PYTHON
    2 min read

    Mypy and Doltpy

    Dolt Dolt is an SQL-database with Git-versioning. The goal of Doltpy , in concert with Dolt , is to solve reproducibility and versioning problems for data and machine learning engineers using Python. Mypy Mypy was created by Guid...

    Read More
  2. SQL
    3 min read

    Dolt CLI in SQL - Update

    Dolt is a SQL database with Git-style versioning. In a previous post we discussed the need to introduce Dolt CLI functions in SQL. We believe that version control is something that can be native to your SQL workflow. This allows for possibilit...

    Read More
  3. BOUNTY
    2 min read

    US Presidential Election $25,000 Database Bounty Review

    On December 14, we launched our first data bounty to earn a share of $25,000 by wrangling US Presidential Precinct-level data . The bounty ended yesterday. How did it go? This blog entry will answer that question. Dolt is a SQL database w...

    Read More
  4. PYTHON
    7 min read

    Doltpy 2.0

    Background Earlier in the week we talked about Dolt's "API surface area." To recap, Dolt is a relational database with version control features. Dolt has a SQL query interface implementing the MySQL dialect, as well as a command line inter...

    Read More
  5. FEATURE RELEASESQL
    6 min read

    Introducing Type Changes

    Dolt is a SQL database with Git-style versioning. With each new version of Dolt , we increase the number of supported SQL features, moving toward our goal of being a complete drop-in replacement for MySQL, while adding all of the versioning fe...

    Read More
  6. 8 min read

    Part I: Dolt API Surface Area

    Background When DoltHub was founded it was called Liquidata. The goal was to bring liquidity to the data market. The founders realized that the pipes were broken: sending around CSV, JSON, and other formats was broken. The requirement to tran...

    Read More
  7. WEB
    11 min read

    A Guide to Unit Testing React Apollo Components

    DoltHub is a place on the internet to share, discover, and collaborate on Dolt databases. It's a Next.js application written in Typescript, backed by a GraphQL server that calls gRPC services written in Golang. We use Apollo 's buil...

    Read More
  8. 5 min read

    Dolt vs MySQL: How it Started, How it's Going

    How it Started For those following along, we've been working on improving Dolt's performance with the goal of making Dolt no more than 2-4 times slower than MySQL . When we set out to measure Dolt's performance we chose Sysbench , a wi...

    Read More
  9. DATASET
    1 min read

    January Dataset Spotlight

    It's that time. Our January dataset spotlight here at DoltHub . For the new folks, Dolt is a SQL database with git-like versioning and DoltHub is a place on the internet to share Dolt databases. This monthly feature keeps you updated on Data...

    Read More
  10. FEATURE RELEASEWEB
    2 min read

    Announcing DoltHub Issues

    DoltHub is a place on the internet to share and collaborate on Dolt databases. We built DoltHub because we thought it would be useful to interact with versioned SQL databases in familiar ways. For example, query public data on the web, or c...

    Read More
  11. 2 min read

    More Hiring

    In October, we set out to hire more engineers to work on Dolt and DoltHub . Dolt is a SQL database with Git-like versioning and DoltHub is a place to share Dolt repositories. Since then, we added three engineers: Vinai , Remy , and Ma...

    Read More
  12. 5 min read

    Release notes generation for GitHub repos

    Introduction Today we're excited to announce the open sourcing of a tool to automatically generate markdown formatted release notes for GitHub repositories . Dolt is using this tool to generate our release notes going forward, and we've ...

    Read More
  13. USE CASE
    5 min read

    Dolt and Data Science - A Simple Example

    Dolt is Git for data, a SQL database with version control. We've been working hard recently on making Dolt a useful tool for Data Science (DS) practitioners and we're hoping to launch some slick integrations soon. But first, we wanted to start...

    Read More
  14. WEB
    5 min read

    Managing DoltHub Dependencies

    Dolt is Git for data and DoltHub is our web application that houses Dolt repositories. DoltHub consists of three separate React applications: our main Next.js app, as well as two Gatsby apps for our blog and documentation. Our depe...

    Read More
  15. 3 min read

    Performance Benchmarks on Pull Request

    Overview Not long ago we wrote about measuring Dolt's performance against MySQL with the goal of improving Dolt to be no more than 2-4 times slower than MySQL . To work toward this goal, we created a containerized tool that benchma...

    Read More
  16. BOUNTY
    2 min read

    Hospital Price Transparency $10,000 Database Bounty

    On January 1, 2021, a US law was passed requiring hospitals to publish their prices in human and machine readable format. We would like to assemble the best open dataset of hospital prices in the US to aid researchers. To this end, we’re launchi...

    Read More
  17. WEB
    7 min read

    Supporting Larger File Imports on DoltHub

    Introduction Back in November, we announced support for uploading CSV files on DoltHub directly to Dolt repository commits. Since then, we've been quickly iterating on features for upload on the web. We recently released changes to our ...

    Read More
  18. GOLANG
    10 min read

    Optimizing varint Decoding

    Introduction Dolt stores data in a content addressable prolly tree in order to get efficient merges and diffs. In designing the table data format one of our goals was to make table column additions and deletions fast operations. They sho...

    Read More
  19. DATASET
    21 min read

    Pennsylvania ballot data revisited

    Introduction In November, shortly after the election, we published an analysis of Pennsylvania ballot data provided by the Pennsylvania Department of State. The purpose of the analysis was to determine if there was any truth to claims of…

    Read More
  20. DATASET
    2 min read

    December Dataset Spotlight

    We have been running the DoltHub dataset spotlight since May 2020. This is our eighth issue. The intent was to add additional exposure to Dolt datasets published on DoltHub. Publishing this blog monthly has presented some challenges conten...

    Read More
JOIN THE DATA EVOLUTION

Get started with Dolt

Or join our mailing list to get product updates.