The input consists of four tables. All tables are required to be present,
however the nodes_obsolete.csv
table can be an empty (with a header).
These tables must adhere to the specified schema.
All input tables are profiled, these details are presented in the Data Profiling section. The pipeline also validates all inputs and halts the process if any errors are found. Validation details are presented in the Data Tests section. For more details about the input data requirements, please read the docs.