In this tutorial, we implement an end-to-end Direct Preference Optimization workflow to align a large language model with human preferences without using a reward model. We combine TRL’s DPOTrainer ...
This repository implements an end-to-end forced alignment pipeline using the Montreal Forced Aligner (MFA). The goal is to automatically align speech audio with its corresponding transcription at word ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results