Can I select a specific Region?

Q: Can I manually select the Region (Vietnam or Japan) for Serverless Inference?

A: Currently, the Serverless Inference package does not support manual selection of the processing Region.

To optimize cost and performance for general users, our system is designed to automatically distribute API requests flexibly between the Vietnam and Japan regions.

Please refer to the table below to understand the routing mechanism and choose the service that fits your needs:

Criteria

Serverless Inference

Dedicated Inference

Routing Mechanism

Cross-Region Auto-routing

Automatically switches regions (VN ↔ JP) during overloads to ensure service continuity.

Fixed Region

Fixed to the Region initially selected by the customer (e.g., Running only in VN).

Data Commitment

High Availability

Prioritizes availability. Data may be processed in another region during peak times.

Data Residency

Prioritizes data location. Guarantees data never leaves the selected Region.

Recommendation

For customers needing convenience and flexible costs, who accept short-term offshore data processing.

For Government and Banking sectors requiring strict compliance with Data Sovereignty laws.

If your project has specific requirements for a fixed processing region (e.g., strictly running in Vietnam for data residency compliance), please switch to the Dedicated Inference service.

Last updated