Select Base Model

Retrieve base models from the Model Hub in two ways:

  • Model Catalog: A repository of model sources from various providers such as DeepSeek, Gemma, Llama, and Qwen.

  • Private Model: A repository for user-owned models and fine-tuned models

The Model Catalog includes the following models:

Base model
Model family
Model type
Model size
Learning stage

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

DeepSeek

LLM

70B

Instruction-tuned

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

DeepSeek

LLM

8B

Instruction-tuned

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

DeepSeek

LLM

1.5B

Instruction-tuned

deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

DeepSeek

LLM

14B

Instruction-tuned

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

DeepSeek

LLM

32B

Instruction-tuned

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

DeepSeek

LLM

7B

Instruction-tuned

google/gemma-3-12b-it

Gemma

LLM

2B

Instruction-tuned

google/gemma-3-12b-pt

Gemma

LLM

2B

Pre-trained

google/gemma-3-1b-it

Gemma

LLM

1B

Instruction-tuned

google/gemma-3-1b-pt

Gemma

LLM

1B

Pre-trained

google/gemma-3-27b-it

Gemma

LLM

27B

Instruction-tuned

google/gemma-3-27b-pt

Gemma

LLM

27B

Pre-trained

google/gemma-3-4b-it

Gemma

LLM

4B

Instruction-tuned

google/medgemma-27b-text-it

Gemma

LLM (Medical)

27B

Instruction-tuned

meta-llama/Llama-3.1-70B

Llama

LLM

70B

Pre-trained

meta-llama/Llama-3.1-70B-Instruct

Llama

LLM

70B

Instruction-tuned

meta-llama/Llama-3.1-8B

Llama

LLM

8B

Pre-trained

meta-llama/Llama-3.1-8B-Instruct

Llama

LLM

8B

Instruction-tuned

meta-llama/Llama-3.2-1B

Llama

LLM

1B

Pre-trained

meta-llama/Llama-3.2-1B-Instruct

Llama

LLM

1B

Instruction-tuned

meta-llama/Llama-3.2-3B

Llama

LLM

3B

Pre-trained

meta-llama/Llama-3.2-3B-Instruct

Llama

LLM

3B

Instruction-tuned

meta-llama/Llama-3.3-70B-Instruct

Llama

LLM

70B

Instruction-tuned

mistralai/Mixtral-8x7B-Instruct-v0.1

Mistral

MoE LLM

8x7B

Instruction-tuned

mistralai/Mixtral-8x7B-v0.1

Mistral

MoE LLM

8x7B

Pre-trained

Qwen/Qwen2-0.5B

Qwen

LLM

0.5B

Pre-trained

Qwen/Qwen2-0.5B-Instruct

Qwen

LLM

0.5B

Instruction-tuned

Qwen/Qwen2-1.5B

Qwen

LLM

1.5B

Pre-trained

Qwen/Qwen2-1.5B-Instruct

Qwen

LLM

1.5B

Instruction-tuned

Qwen/Qwen2-72B

Qwen

LLM

72B

Pre-trained

Qwen/Qwen2-72B-Instruct

Qwen

LLM

72B

Instruction-tuned

Qwen/Qwen2-7B

Qwen

LLM

7B

Pre-trained

Qwen/Qwen2-7B-Instruct

Qwen

LLM

7B

Instruction-tuned

Qwen/Qwen2-VL-2B

Qwen

VLM

2B

Pre-trained

Qwen/Qwen2-VL-2B-Instruct

Qwen

VLM

2B

Instruction-tuned

Qwen/Qwen2-VL-72B

Qwen

VLM

72B

Pre-trained

Qwen/Qwen2-VL-72B-Instruct

Qwen

VLM

72B

Instruction-tuned

Qwen/Qwen2-VL-7B

Qwen

VLM

7B

Pre-trained

Qwen/Qwen2-VL-7B-Instruct

Qwen

VLM

7B

Instruction-tuned

Qwen/Qwen2.5-0.5B

Qwen

LLM

0.5B

Pre-trained

Qwen/Qwen2.5-0.5B-Instruct

Qwen

LLM

0.5B

Instruction-tuned

Qwen/Qwen2.5-1.5B

Qwen

LLM

1.5B

Pre-trained

Qwen/Qwen2.5-1.5B-Instruct

Qwen

LLM

1.5B

Instruction-tuned

Qwen/Qwen2.5-14B

Qwen

LLM

14B

Pre-trained

Qwen/Qwen2.5-14B-Instruct

Qwen

LLM

14B

Instruction-tuned

Qwen/Qwen2.5-32B

Qwen

LLM

32B

Pre-trained

Qwen/Qwen2.5-32B-Instruct

Qwen

LLM

32B

Instruction-tuned

Qwen/Qwen2.5-3B

Qwen

LLM

3B

Pre-trained

Qwen/Qwen2.5-3B-Instruct

Qwen

LLM

3B

Instruction-tuned

Qwen/Qwen2.5-72B

Qwen

LLM

72B

Pre-trained

Qwen/Qwen2.5-72B-Instruct

Qwen

LLM

72B

Instruction-tuned

Qwen/Qwen2.5-7B

Qwen

LLM

7B

Pre-trained

Qwen/Qwen2.5-7B-Instruct

Qwen

LLM

7B

Instruction-tuned

Qwen/Qwen2.5-VL-32B-Instruct

Qwen

VLM

32B

Instruction-tuned

Qwen/Qwen2.5-VL-3B-Instruct

Qwen

VLM

3B

Instruction-tuned

Qwen/Qwen2.5-VL-72B-Instruct

Qwen

VLM

72B

Instruction-tuned

Qwen/Qwen2.5-VL-7B-Instruct

Qwen

VLM

7B

Instruction-tuned

Qwen/Qwen3-0.6B

Qwen

LLM

0.6B

Pre-trained

Qwen/Qwen3-1.7B

Qwen

LLM

1.7B

Pre-trained

Qwen/Qwen3-14B

Qwen

LLM

14B

Pre-trained

Qwen/Qwen3-30B-A3B

Qwen

LLM

30B

Pre-trained

Qwen/Qwen3-32B

Qwen

LLM

32B

Pre-trained

Qwen/Qwen3-4B

Qwen

LLM

4B

Pre-trained

Qwen/Qwen3-8B

Qwen

LLM

8B

Pre-trained

The Private Model, if you want to upload your models, please contact us or follow the guide upload model through SDK

Last updated