Welcome to Congo! :tada:/Academic Blogs/Cutedgeing NLP techniques/Cutedgeing NLP techniques1 min· Table of ContentsDirect Preference OptimisationDecision TransformersFine-tuning via reinforcement learning with human feedbackActiveDirect Preference Optimisation #Decision Transformers #Fine-tuning via reinforcement learning with human feedback #Active #