phenotrex.transforms package

Submodules

phenotrex.transforms.annotation module

phenotrex.transforms.resampling module

class phenotrex.transforms.resampling.TrainingRecordResampler(random_state: Optional[float] = None, verb: bool = False)[source]

Bases: object

Instantiates an object which can generate versions of a TrainingRecord resampled to defined completeness and contamination levels. Requires prior fitting with full List[TrainingRecord] to get sources of contamination for both classes.

Parameters
  • random_state – Randomness seed to use while resampling

  • verb – Toggle verbosity

fit(records: List[phenotrex.structure.records.TrainingRecord])[source]

Fit TrainingRecordResampler on full TrainingRecord list to determine set of positive and negative features for contamination resampling.

Parameters

records – the full List[TrainingRecord] on which ml training will commence.

Returns

True if fitting was performed, else False.

get_resampled(record: phenotrex.structure.records.TrainingRecord, comple: float = 1.0, conta: float = 0.0) phenotrex.structure.records.TrainingRecord[source]

Resample a TrainingRecord to defined completeness and contamination levels. Comple=1, Conta=1 will double set size.

Parameters
  • comple – completeness of returned TrainingRecord features. Range: 0 - 1

  • conta – contamination of returned TrainingRecord features. Range: 0 - 1

  • record – the input TrainingRecord

Returns

a resampled TrainingRecord.

Module contents

phenotrex.transforms.fastas_to_grs(*args, **kwargs)