lmflow.pipeline.dpo_aligner#
Classes#
A subclass of BasePipeline which is alignable. |
Functions#
|
Load dataset and convert it to the necessary format. |
Module Contents#
- lmflow.pipeline.dpo_aligner.get_paired_dataset(data_root: str, data_dir: str, sanity_check: bool = False, cache_dir: str | None = None, num_proc=24) datasets.Dataset [source]#
Load dataset and convert it to the necessary format.
The dataset is converted to a dictionary with the following structure: {
‘prompt’: List[str], ‘chosen’: List[str], ‘rejected’: List[str],
}
- Prompts are structured as follows:
“Question: “ + <prompt> + “
Answer: “
- class lmflow.pipeline.dpo_aligner.DPOAligner(model_args, data_args, aligner_args)[source]#
Bases:
lmflow.pipeline.base_aligner.BaseAligner
A subclass of BasePipeline which is alignable.