config.yml
Using the PipeRider config.yml file
config.yml
is the main PipeRider project configuration file, and contains data sources and related profiling settings.
An example config.yml
file for a Postgres project:
The config.yml
file is created when a new project is initialized, and stores the following information for your PipeRider project.
Data source settings
The name of your data source
The type of data source e.g. sqlite, postgres etc.
(sqlite) The path to the database file
(dbt) the dbt project information (profile, target, project path)
Telemetry ID for anonymized tracking
If the data source requires credentials they will be stored separately in credentials.yml
.
For SQLIte projects, it is also possible to store the database path in credentials.yml
if desired.
Profiler settings
The following settings enable you to configure the behavior of the profiler. Uncomment and adjust the settings as required.
Profile a sample of rows
Set the maximum number of rows to profile per datasource table. In the following example, a maximum number of 10,000 rows will be profiled:
Profile duplicate rows
Enabling it to let the profiler find the duplicate rows from the table. It is disabled by default due to it could be a time-consuming process according to datasets.
Include/exclude tables from profiling
By default, PipeRider will profile all existing tables. To specifically include or exclude tables from being profiled, add or remove tables from the includes
and excludes
arrays.
PipeRider will profile tables specified in includes
and ignore tables specified in excludes
.
An empty array means no tables are specified. To profile all tables, leave these options commented.
Profile views
By default, profiling views is not enabled. To allow PipeRider to profile views, uncomment the following line:
Table settings
Add table descriptions
Add table and column descriptions which will be shown on your PipeRider report.
Telemetry
This is the anonymous project id that was created during project initialization.
Last updated