Ctrlk

✌️How to Evaluate a Model?

Run History

Run history provides a detailed log of all testing runs for a selected model.

Notice: Each record in history shows when the job was started

Run Details

The Run details page provides a comprehensive overview of a fine-tuning job. It includes metadata, configuration settings, and metrics.

You can see:

Input: The input data of the test, for example, a question in a Question Answering task.
Ground Truth: The correct answer (label) corresponding to each input, used for comparison with the predicted result.
Output: The answer that the model produces based on the input.
Metrics: Calculated based on the comparison between output and ground truth.

PreviousHow to Manage a Job?NextUser Token

Last updated 1 month ago