✌️How to Evaluate Model?

Playground

Playground is space where users can interact with AI models in a chat-like format. It’s designed for testing messages, evaluating model responses, and adjusting model behavior.

You can follow guide to evaluate your model:

1

Adjust parameters

Parameters include:

Name
Description
Type
Supported value

Temperature

Controls randomness in assistant responses. Lower = more focused, higher = more creative.

Float

[0.00, 2.00] (commonly 1.00 is balanced)

Add stop sequence

Defines where the model should stop generating text.

String

Custom string(s)

Output length

Limits the number of tokens in the response.

Int

[0, 8192]

Top-P

Controls diversity via nucleus sampling. Lower = more focused

Float

(0.00, 1.00]

We recommend you adjust parameters based on each purpose:

Purpose
Temperature
Add stop sequence
Output length
Top-P

Creative writing

1.0–1.5

500+

0.8–1.0

Technical explanation

0.2–0.5

200–500

0.3–0.6

Summarization

0.2–0.5

150–300

0.3–0.6

Code generation

0.1–0.4

“n” or “#”

0.3–0.6

Conversational agent

0.7–1.0

200–600

0.8–1.0

2

Start chatting

The Playground interface is structured around Completions, which include:

Prompts
Description

System messages

Define the assistant’s behavior and tone. It helps guide how the model responds throughout the conversation. Eg: You are a creative writing assistant. Always write with vivid imagery, emotional depth, and a storytelling tone.

User messages

Represent the input or query from you. You can type text to test LLM models or add to upload an image to test VLM models.

Assistant responses

The AI model’s reply is based on user messages and system messages.

You can interact with the model in real time, observe how it responds, and iterate based on your goals.

3

Refine your prompts

To improve or explore different behaviors:

  • Modify the system message to change the assistant’s personality or tone.

  • Adjust the user message to test different types of queries.

  • Tweak the parameters to observe how the model’s output changes.

4

Clear completion

  1. Click icon Clear to reset and start a new completion.

  2. After clearing completion, completion is finished and stored in completion history.

Completion History

Completion history allows you to view and manage the past completions. It helps track previous messages, responses, and session details for review or reuse.

Accessing completion history

  1. Navigate to the Completion History tab in your interface. This page displays a list of all your current and past completions.

  2. You can browse through the list or use the search bar to find specific completions using:

  • Keywords from the last assistant responses.

Viewing completion details

  1. Click on one completion to expand it.

  2. You will see:

  • User message

  • Assistant response

  • Duration of the completion

  • Timestamp of when it occurred

Continuing a completion

  1. To resume a past completion, click the Continue button.

  2. This will reopen Playground, allowing you to continue chatting with your model.

Delete a completion

To remove a completion from your history:

  1. Click the trash icon or the Delete button.

  2. Confirmation popup show to confirm the deletion

Notice: Deleted completions are permanently removed and cannot be recovered.

Download all completions

To export your entire completion history:

  1. Click the Download button

  2. The supported file format is '.xlsx'. It consists of all user messages, assistant responses, timestamps, and metadata for each completion.

  3. (Optional) Download completion you want to select by clicking the start icon and navigate My pick

Last updated