mex.common.primary_source package¶
Submodules¶
mex.common.primary_source.extract module¶
mex.common.primary_source.helpers module¶
mex.common.primary_source.models module¶
mex.common.primary_source.transform module¶
Module contents¶
Helper extractor to get metadata primary sources.
It represents the original source of metadata that all the data in MEx will attach to. For example confluence-vvt primary source means: data extracted from confluence-vvt x-system will be attached to this primary source.
Common use cases¶
extract info of a particular primary source to attach extracted metadata to
Configuration¶
To configure primary_source extractor, set primary_sources_path in settings to point to primary-sources.json in mex-assets repository. A sample primary sources file is also included in mex-extractors at assets/raw-data/primary-sources/primary-sources.json for testing purposes.
Extracting primary sources¶
Use extract_seed_primary_sources in primary_source.extract function to extract all primary sources. This function will yield all the primary sources available in primary_sources.json source file.
Transforming primary sources¶
Use transform_seed_primary_sources_to_extracted_primary_sources in primary_sources.transform to get ExtractedPrimarySource. This function will yield all the primary sources, which is often not required.
So to filter out only the required x-system primary sources use get_primary_sources_by_name in primary_sources.transform. This function needs Iterable from step-1 and names of the required x-systems. For example by passing names as mex, ldap, confluence-vvt will return ExtractedPrimarySource of these x-systems.