|
|
8 months ago | |
|---|---|---|
| .. | ||
| accelerate_configs | 8 months ago | |
| .env.example | 8 months ago | |
| 00_quick_test.py | 8 months ago | |
| 01_dataset_loading.py | 8 months ago | |
| 02_reward_functions.py | 8 months ago | |
| 03_lora_configuration.py | 8 months ago | |
| 04_sft_training.py | 8 months ago | |
| 05_grpo_training.py | 8 months ago | |
| 06_complete_pipeline.py | 8 months ago | |
| 07_model_evaluation.py | 8 months ago | |
| 08_distributed_training.py | 8 months ago | |
| config.json | 8 months ago | |