Data build tool

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
dbt
Developerdbt Labs
Initial releaseDecember 3, 2021; 4 years ago (2021-12-03)
Stable release
1.10.11 / September 4, 2025; 7 months ago (2025-09-04) [1]
Repository
  • {{URL|example.com|optional display text}}Lua error in Module:EditAtWikidata at line 29: attempt to index field 'wikibase' (a nil value).
Written inPython
Engine
    Lua error in Module:EditAtWikidata at line 29: attempt to index field 'wikibase' (a nil value).
    Operating systemMicrosoft Windows, macOS, Linux
    Available inPython
    TypeData analytics, data management
    LicenseApache License 2.0
    Websitedocs.getdbt.com

    Data build tool (dbt) is an open-source command line tool that helps analysts and engineers transform data in their warehouse more effectively.[2]

    History

    [edit | edit source]

    It started at RJMetrics in 2016 as a solution to add basic transformation capabilities to Stitch (acquired by Talend in 2018).[3] The earliest versions of dbt allowed analysts to contribute to the data transformation process following the best practices of software engineering.[4]

    From the beginning, dbt was open source.[5] In 2018, the dbt Labs team (then called Fishtown Analytics) released a commercial product on top of dbt Core.[6]

    Funding

    [edit | edit source]

    In April 2020, dbt Labs announced its Series A led by Andreessen Horowitz.[7] In November, dbt Labs announced its Series B led by Andreessen Horowitz and Sequoia.[8] And in June 2021, dbt Labs raised its Series C led by Altimeter, Sequoia, and Andreessen Horowitz.[9] In February 2022, the company raised $222 million for its Series D, at a $4.2 billion valuation.[10]

    Overview

    [edit | edit source]

    Dbt enables analytics engineers to transform data in their warehouses by writing select statements, and turns these select statements into tables and views. Dbt does the transformation (T) in extract, load, transform (ELT) processes – it does not extract or load data, but is designed to be performant at transforming data already inside of a warehouse. Dbt has the goal of allowing analysts to work more like software engineers, in line with the dbt viewpoint.[11]

    Dbt uses YAML files to declare properties. seed is a type of reference table used in dbt for static or infrequently changed data, like for example country codes or lookup tables), which are CSV based and typically stored in a seeds folder.

    References

    [edit | edit source]
    1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    3. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    5. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    6. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    7. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    8. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    9. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    10. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    11. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).