Skip to main content

Overview

CloudBase AI provides out-of-the-box text generation capabilities, supporting mainstream large models including DeepSeek, Hunyuan, MiniMax, Kimi, GLM, and more. It covers the following core features:

FeatureDescription
Multiple Access MethodsMini Program SDK, wx-server-sdk, Web SDK, Node SDK, cURL, OpenAI SDK, Anthropic SDK
Multi-turn ConversationCarry conversation history for continuous dialogue
Streaming OutputReturn tokens incrementally for better user experience
Deep ThinkingInvoke reasoning-capable models to obtain thinking process
Context CachingCache long prompt prefixes to reduce repeated invocation costs
Context ManagementAutomatically truncate or compress long conversation history
Tool CallingLet models call custom functions/tools

Access Methods

CloudBase provides three ways to access large models:

MethodDescriptionApplicable Plan/Scenario
Resource-Point PlanUse plan resource points to deduct Token usage; supports DeepSeek, MiniMax, Hunyuan, Kimi, GLM, and other modelsResource-Point Plan
Trial ModelUse the built-in Hunyuan model for freeMini Program Growth Plan
Custom ModelConfigure any model compatible with OpenAI / Anthropic protocolStandard plan and above

Resource-Point Plan

info

The Token Resource Pack was discontinued on June 18, 2026. If your current plan is not a Resource-Point Plan, please go to CloudBase Console → Plan Management to switch to a Resource-Point Plan first.

Resource points included in the Resource-Point Plan can be used to deduct large model Token usage, supporting the latest models such as DeepSeek, MiniMax, Hunyuan, Kimi, GLM, and more. See the supported model list.

Go to Activate Resource-Point Plan

Trial Model

The trial model is exclusively for the Mini Program Growth Plan. Currently only Hunyuan is supported. To use DeepSeek or other models, please switch to a Resource-Point Plan.

The following Hunyuan models are currently available:

ModelDescription
hy3-previewHunyuan instruction model

Custom Model

CloudBase Standard plan and above supports configuring third-party models. Once configured, you can call them directly via the CloudBase JS SDK and Mini Program SDK.

Supported Protocols

Custom models support large model interfaces compatible with the following protocols. See CloudBase LLM API Protocol Guide for details:

  • Chat Completions API (OpenAI compatible, most universal)
  • Responses API
  • Anthropic Messages API

Required Parameters

ParameterDescription
BaseURLThe API request URL of the large model
APIKeyThe key for accessing the large model API
Model NameThe specific model identifier, e.g., deepseek-v4-flash, deepseek-v3, etc.

Configuration Steps

  1. Obtain BaseURL and API Key

    The following provides integration references for Hunyuan and DeepSeek. For other models, refer to the corresponding provider's documentation:

    ModelProviderBaseURLModel NameBilling
    hunyuanTencent Hunyuanhttps://api.hunyuan.cloud.tencent.com/v1See Hunyuan Model ListSee Billing Docs
    DeepSeekDeepSeek Officialhttps://api.deepseek.com/v1See DeepSeek Model ListSee DeepSeek Docs
  2. Configure in the platform

    Go to the CloudBase AI+ console. There are two ways to configure:

    • Supported platforms: Select the corresponding platform and fill in the API Key directly

      Select platform to configure API Key

    • Other models: Click the "Add Model Provider" button and fill in the BaseURL, API Key, and model name

      Add model provider

Frequently Asked Questions

How do I determine if a large model is compatible with the supported protocols?

CloudBase supports three protocols: Chat Completions, Responses API, and Anthropic Messages API. Most mainstream large models (DeepSeek, GLM, Kimi, Qwen, etc.) are compatible with the Chat Completions protocol, and typically state "OpenAI compatible" explicitly in their API documentation.

What should I do if the large model doesn't work properly after configuration?

Please check the following:

  • Is the API Key correct and not expired?
  • Is the BaseURL accurate?
  • Does the model name match the one in the provider's documentation?
  • Have you enabled billing or topped up your account on the third-party platform?

How much concurrency does the large model support?

  • When using the trial model, one environment supports 5 concurrent requests.
  • After configuring a custom model, the platform no longer limits concurrency. The actual concurrency limit depends on the third-party model provider's restrictions.