arche.rules.duplicates module

arche.rules.duplicates.check_items(df: pandas.core.frame.DataFrame, tagged_fields: Dict[str, List[str]]) → arche.rules.result.Result

Check for items with the same name and url

arche.rules.duplicates.check_uniqueness(df: pandas.core.frame.DataFrame, tagged_fields: Dict[str, List[str]]) → arche.rules.result.Result

Verify if each item field tagged with unique is unique.

Returns

A result containing field names and keys for non unique items

arche.rules.duplicates.find_by(df: pandas.core.frame.DataFrame, columns: List[str]) → arche.rules.result.Result

Compare items rows in df by columns

Returns

Any duplicates