arche.arche module

class arche.arche.Arche(source: str, schema: Union[str, Dict[str, Dict[str, Union[str, bool, int, float, None, List[T]]]], None] = None, target: Optional[str] = None, start: int = 0, count: Optional[int] = None, filters: Optional[List[Tuple[str, str, str]]] = None, expand: bool = True)

Bases: object

basic_json_schema(items_numbers: List[int] = None)

Prints a json schema based on data from self.source

Parameters

items_numbers – array of item numbers to create a schema from

check_metadata
compare_metadata
compare_with_customized_rules(source_items, target_items, tagged_fields)
data_quality_report(bucket: Optional[str] = None)
static get_items(source: str, start: int, count: Optional[int], filters: Optional[List[Tuple[str, str, str]]], expand: bool) → Union[arche.readers.items.JobItems, arche.readers.items.CollectionItems]
glance()

Run JSON schema check and output results. In most cases it will stop after the first error per item. Usable for big jobs as it’s about 100x faster than validate_with_json_schema().

report_all()
run_all_rules()
run_comparison_rules
run_customized_rules(items, tagged_fields)
run_general_rules
run_schema_rules()
save_result(rule_result)
schema
source_items
target_items
validate_with_json_schema()

Run JSON schema check and output results. It will try to find all errors, but there are no guarantees. Slower than check_with_json_schema()