Intelligent Search

Intelligent Search (MetaInsight) is an AI-powered capability based on Tencent Cloud Infinite that provides intelligent content analysis and retrieval for multimedia files such as images, videos, and documents in cloud storage, helping you quickly locate and manage massive file resources.

Feature Introduction

Intelligent Search integrates multiple retrieval capabilities, supporting both scalar retrieval (traditional metadata-based retrieval) and vector retrieval (AI-powered semantic retrieval), providing multi-modal hybrid search capabilities.

Core Capabilities

Retrieval Type	Description	Application Scenarios
Basic Information Retrieval	Metadata-based retrieval such as filename, size, type, creation time, etc.	File management, data statistics
Image Retrieval	Image-to-image and text-to-image search based on image content semantics	Photo albums, product search and recommendation
Face Retrieval	Recognize facial features in images and search photos containing the same person	Security monitoring, attendance systems, smart albums

Retrieval Methods Comparison

Feature	Scalar Retrieval	Vector Retrieval
Principle	Exact matching based on file attributes (metadata)	Similarity matching based on AI semantic understanding
Typical Use	Query by filename, type, time	Image content search, document semantic retrieval
Advantages	Fast query speed, precise results	Intelligent understanding, supports fuzzy matching
Use Cases	Queries with known file attributes	Content relevance queries

Application Scenarios

Image Resource Management: Quickly locate needed image materials
Photo Album Applications: Classify photos by people, places, and objects
E-commerce Platforms: Intelligent classification and search of product images
Media Asset Management: Intelligent tagging and retrieval of video materials
Content Creation: Quickly find materials that meet requirements

Preparation

Before using Intelligent Search, you need to enable Cloud Infinite capability and create a Dataset.

Step 1: Enable Cloud Infinite Capability

Click "Advanced Capabilities" → "Intelligent Search" on the CloudBase Cloud Storage page
Or directly visit COS Console Intelligent Search Page
- Replace {YourBucketName} with your bucket name (e.g., 36c8-static-lowcode-0gwpl9v4125156ef-1258057692)
- Replace {YourRegion} with your region (e.g., ap-shanghai, ap-guangzhou)
Follow the page prompts to enable Cloud Infinite intelligent search capability

Step 2: Create Dataset

⚠️ Notice

Currently, only the basic information template created when enabling Cloud Infinite is supported. Cloud storage buckets do not support creating new datasets independently.

Log in to COS Console
Select "Intelligent Search" → "Dataset Management" in the left navigation bar
Click the "Create Dataset" button
Configure dataset information:
- Dataset Name: Custom dataset name
- Dataset Description: Optional, describe the purpose of the dataset
- Associated Bucket: Select the COS bucket corresponding to your CloudBase cloud storage
- File Path: Specify the file path to be searched (optional all or specified prefix)
Click "Confirm" to complete creation

💰 Billing Information

Intelligent Search is billed based on actual usage, including:

Dataset Storage: Charged by the number of files in the dataset and storage duration
Vector Index: Charged by the number of files with vector indexes built
Retrieval Requests: Charged by the number of retrieval requests

For detailed pricing, please refer to: Cloud Infinite Billing Instructions

Step 3: Wait for Metadata Index Completion

After creating the dataset, you need to build indexes for files:

In the dataset list, click the dataset name to enter the details page
Click "File Index Management"
Select the file types to be indexed:
- Image Index: Supports JPG, PNG, BMP, WebP and other formats
- Document Index: Supports PDF, Word, Excel, PPT, TXT and other formats
- Face Index: Automatically recognizes faces in images and builds a face library
Click "Start Indexing", the system will automatically analyze files and build indexes

Indexing Time: Usually completes within 1-10 minutes after task submission, depending on file quantity and size.

Using Intelligent Search

Intelligent Search provides two methods: console operations and API calls.

Console Retrieval

Through the COS console visual interface, you can perform retrieval supporting two types: basic information and multi-modal hybrid retrieval.

1. Basic Information Retrieval

Suitable for queries based on file attributes such as filename, contentType, file size, etc.

Steps:

Log in to COS Console
Select "Intelligent Search" → "Data Retrieval" in the left navigation bar
Select retrieval scenario as "Basic Information Retrieval"
Configure query conditions:
- Select dataset
- Set query conditions (supports multiple condition combinations):
  - Filename: Supports exact or fuzzy matching
  - Content Type (contentType): e.g., image/jpeg, application/pdf
  - File Size: Supports range queries
  - Creation Time: Supports time range queries
  - Custom Tags: Based on custom file metadata
Advanced Settings (optional):
- Set sorting method (by time, size, etc.)
- Configure data aggregation rules
- Set number of results returned
Click "Query" to get results
Support exporting results via "Export List" button or viewing export tasks in "Task Center"

Query Condition Examples:

Query Requirement	Query Condition Setting
Find all JPG images	contentType = "image/jpeg"
Find files larger than 1MB	File size > 1048576
Find files from last 7 days	Creation time >= Current time - 7 days
Find images with "product" in filename	Filename contains "product" AND contentType starts with "image/"

Supported Operators: =, !=, >, >=, <, <=, contains, starts with, ends with, etc.

For detailed field and operator support list, please refer to: COS Field and Operator Documentation

⚠️ Feature Status

Multi-modal Retrieval is not available yet.

Multi-modal retrieval supports AI retrieval scenarios based on image content and face recognition, including image-to-image search, text-to-image search, and face retrieval.

Steps:

Log in to COS Console
Select "Intelligent Search" → "Data Retrieval" in the left navigation bar
Select retrieval scenario as "Multi-modal Hybrid Retrieval"
Input retrieval content:
- Text Retrieval: Enter descriptive text, e.g., "landscape photo with blue sky and white clouds", "red dress"
- Image Retrieval: Upload reference image or select image from bucket (supports both general image retrieval and face retrieval)
Advanced Settings:
- Return Quantity: Set maximum number of results returned (default 10, max 100)
- Similarity Threshold: Set minimum similarity score (0-100, recommended 60-80 for image retrieval, 70-85 for face retrieval)
- Retrieval Type: Select to search images or all
Metadata Filtering (optional):
- Further filter by metadata conditions based on retrieval results
- Example: Only return images created in 2024
Click "Query" to get results

Multi-modal Retrieval Examples:

Image-to-Image Search:

Upload a "beach landscape" image
Set similarity threshold to 70
System returns all photos with similar content to the image

Text-to-Image Search:

Enter text description: "bride in white wedding dress"
Set return quantity to 20
System returns semantically matched images

Face Retrieval:

Upload a clear face photo
Set similarity threshold to 75
System returns all photos containing that person
Can sort by time or similarity

💡 Retrieval Optimization Tips

Image Retrieval: Use clear images with distinct subjects, recommended similarity threshold 60-80
Face Retrieval: Use clear, frontal face photos as reference images, recommended similarity threshold 70-85
Index Configuration: Face retrieval requires face index to be configured in the dataset in advance

API Calls

Intelligent Search supports API calls that can be integrated into your application.

📚 API Reference

For complete API usage and parameters, please refer to: Data Retrieval API Documentation

FAQ

How long does indexing take?

Usually completed within 1-5 minutes after file upload, depending on file size and quantity.

What file formats are supported?

Images: JPG, PNG, BMP, GIF, WebP, HEIF
Videos: MP4, AVI, MOV, FLV and other mainstream formats

How to improve retrieval accuracy?

High-quality Images: Use clear images with distinct subjects
Multi-tag Combination: Use multiple tag combinations for retrieval
Reasonable Threshold: Adjust similarity threshold appropriately for image search
Manual Review: Conduct manual secondary confirmation for important retrieval results

How to build indexes for existing files?

After enabling Intelligent Search, you need to manually trigger index building for existing files:

Select "Batch Build Index" in the console
Select directories or files to be indexed
Wait for index task to complete

How is index data stored?

Intelligent Search index data is managed by Cloud Infinite service and does not occupy your cloud storage space. It is recommended to sync tag information to CloudBase database for complex queries.

Feature Introduction​

Core Capabilities​

Retrieval Methods Comparison​

Application Scenarios​

Preparation​

Step 1: Enable Cloud Infinite Capability​

Step 2: Create Dataset​

Step 3: Wait for Metadata Index Completion​

Using Intelligent Search​

Console Retrieval​

1. Basic Information Retrieval​

2. Multi-modal Retrieval​

API Calls​

FAQ​

How long does indexing take?​

What file formats are supported?​

How to improve retrieval accuracy?​

How to build indexes for existing files?​

How is index data stored?​

Related Resources​