Conversation
✱ Stainless preview buildsThis PR will update the ✅ togetherai-python studio · code
✅ togetherai-go studio · code
|
| labels: ['good', 'bad'], | ||
| pass_labels: ['good'], | ||
| model_to_evaluate: 'meta-llama/Llama-3.1-8B-Instruct-Turbo', | ||
| model_to_evaluate: 'Qwen/Qwen3.5-9B', |
There was a problem hiding this comment.
we should change the judge model in all these from model: 'meta-llama/Llama-3.1-70B-Instruct-Turbo' to model: 'meta-llama/Llama-3.1-70B-Instruct-Turbo' -> model: 'openai/gpt-oss-120b'
| labels: ['good', 'bad'], | ||
| pass_labels: ['good'], | ||
| model_to_evaluate: 'meta-llama/Llama-3.1-8B-Instruct-Turbo', | ||
| model_to_evaluate: 'Qwen/Qwen3.5-9B', |
There was a problem hiding this comment.
we should change the judge model in all these from model: 'meta-llama/Llama-3.1-70B-Instruct-Turbo' to model: 'meta-llama/Llama-3.1-70B-Instruct-Turbo' -> model: 'openai/gpt-oss-120b'
| labels=["good", "bad"], | ||
| pass_labels=["good"], | ||
| model_to_evaluate="meta-llama/Llama-3.1-8B-Instruct-Turbo" | ||
| model_to_evaluate="Qwen/Qwen3.5-9B" |
There was a problem hiding this comment.
we should change the judge model in all these from model: 'meta-llama/Llama-3.1-70B-Instruct-Turbo' to model: 'meta-llama/Llama-3.1-70B-Instruct-Turbo' -> model: 'openai/gpt-oss-120b'
| labels=["good", "bad"], | ||
| pass_labels=["good"], | ||
| model_to_evaluate="meta-llama/Llama-3.1-8B-Instruct-Turbo" | ||
| model_to_evaluate="Qwen/Qwen3.5-9B" |
There was a problem hiding this comment.
we should change the judge model in all these from model: 'meta-llama/Llama-3.1-70B-Instruct-Turbo' to model: 'meta-llama/Llama-3.1-70B-Instruct-Turbo' -> model: 'openai/gpt-oss-120b'
No description provided.