Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for preference tuning pipelines #376

Open
Tracked by #335 ...
ktam3 opened this issue Nov 13, 2024 · 2 comments
Open
Tracked by #335 ...

Add support for preference tuning pipelines #376

ktam3 opened this issue Nov 13, 2024 · 2 comments
Assignees

Comments

@ktam3
Copy link

ktam3 commented Nov 13, 2024

After things are reconciled, we need to add support for preference tuning pipelines

Introduce 3 pipelines:

  • Data annotation routing > route samples for preference tuning and annotate
  • Student Model response generation
  • Log likelihood generation for weak vs strong model for reward calculation
@ktam3 ktam3 changed the title In SDG - After things are reconciled, we need to add support for preference tuning pipelines - Introduce 3 pipelines: - Data annotation routing > route samples for preference tuning and annotate - Student Model response generation - Log likelihood generation for weak vs strong model for reward calculation Add support for preference tuning pipelines Nov 13, 2024
@ktam3 ktam3 transferred this issue from instructlab/training Nov 13, 2024
@ktam3
Copy link
Author

ktam3 commented Nov 13, 2024

@Maxusmusti - i can't add you as an assignee on here

@aakankshaduggal
Copy link
Member

@ktam3 i will work in collaboration with the training team(@Maxusmusti) for this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants