mex.extractors.sinks package¶
Submodules¶
mex.extractors.sinks.base module¶
- mex.extractors.sinks.base.load(models: Iterable[ExtractedAccessPlatform | ExtractedActivity | ExtractedBibliographicResource | ExtractedConsent | ExtractedContactPoint | ExtractedDistribution | ExtractedOrganization | ExtractedOrganizationalUnit | ExtractedPerson | ExtractedPrimarySource | ExtractedResource | ExtractedVariable | ExtractedVariableGroup | MergedAccessPlatform | MergedActivity | MergedBibliographicResource | MergedConsent | MergedContactPoint | MergedDistribution | MergedOrganization | MergedOrganizationalUnit | MergedPerson | MergedPrimarySource | MergedResource | MergedVariable | MergedVariableGroup | AccessPlatformRuleSetResponse | ActivityRuleSetResponse | BibliographicResourceRuleSetResponse | ConsentRuleSetResponse | ContactPointRuleSetResponse | DistributionRuleSetResponse | OrganizationRuleSetResponse | OrganizationalUnitRuleSetResponse | PersonRuleSetResponse | PrimarySourceRuleSetResponse | ResourceRuleSetResponse | VariableRuleSetResponse | VariableGroupRuleSetResponse]) None ¶
Load models to all of the configured sinks.
mex.extractors.sinks.s3 module¶
- class mex.extractors.sinks.s3.S3Sink¶
Bases:
BaseSink
Sink to load models as new-line delimited JSON file into S3 bucket.
- CHUNK_SIZE = 1000¶
- SERVICE_NAME = 's3'¶
- SIGNATURE_VERSION = 's3'¶
- __init__() None ¶
Instantiate a new S3 sink.
- close() None ¶
Close the underlying boto client.
- load(items: Iterable[_LoadItemT]) Generator[_LoadItemT, None, None] ¶
Write the incoming items as an NDJSON directly to S3.
- Parameters:
items – Iterable of any kind of items
- Returns:
Generator for the loaded items
Module contents¶
- mex.extractors.sinks.load(models: Iterable[ExtractedAccessPlatform | ExtractedActivity | ExtractedBibliographicResource | ExtractedConsent | ExtractedContactPoint | ExtractedDistribution | ExtractedOrganization | ExtractedOrganizationalUnit | ExtractedPerson | ExtractedPrimarySource | ExtractedResource | ExtractedVariable | ExtractedVariableGroup | MergedAccessPlatform | MergedActivity | MergedBibliographicResource | MergedConsent | MergedContactPoint | MergedDistribution | MergedOrganization | MergedOrganizationalUnit | MergedPerson | MergedPrimarySource | MergedResource | MergedVariable | MergedVariableGroup | AccessPlatformRuleSetResponse | ActivityRuleSetResponse | BibliographicResourceRuleSetResponse | ConsentRuleSetResponse | ContactPointRuleSetResponse | DistributionRuleSetResponse | OrganizationRuleSetResponse | OrganizationalUnitRuleSetResponse | PersonRuleSetResponse | PrimarySourceRuleSetResponse | ResourceRuleSetResponse | VariableRuleSetResponse | VariableGroupRuleSetResponse]) None ¶
Load models to all of the configured sinks.