RAG Knowledge Base: Add advanced chunking strategies

### Self Checks

- [x] I have searched for existing issues [search for existing issues](https://github.com/xpert-ai/xpert/issues), including closed ones.
- [x] [FOR CHINESE USERS] 请务必使用英文提交 Issue，谢谢！:）
- [x] Please do not modify this template :) and fill in all the required fields.

### 1. Is this request related to a challenge you're experiencing? Tell me about your story.

In the current platform, the RAG knowledge base only provides three text chunking options. We suggest adding mainstream advanced strategies, especially Semantic Chunking, which uses embeddings to measure similarity between adjacent sentences/paragraphs and only splits when the topic actually changes. This keeps chunks more coherent and reduces context fragmentation caused by fixed or rule-only splitting.
We also recommend adding QA-based Chunking / QA Augmentation: during indexing, let an LLM read document sections and generate 3–5 hypothetical questions per section, then store these questions as retrieval anchors. At query time, user questions are matched against these generated QAs, which can significantly improve retrieval precision and intent matching, especially for complex or indirect queries.

### 2. Additional context or comments

As shown in the screenshot:
<img width="1890" height="1482" alt="Image" src="https://github.com/user-attachments/assets/d2e61fe3-ccb4-4bc6-91b0-f5f63e12cff5" />

### 3. Can you help us with this feature?

- [x] I am interested in contributing to this feature.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAG Knowledge Base: Add advanced chunking strategies #421

Self Checks

1. Is this request related to a challenge you're experiencing? Tell me about your story.

2. Additional context or comments

3. Can you help us with this feature?

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

RAG Knowledge Base: Add advanced chunking strategies #421

Description

Self Checks

1. Is this request related to a challenge you're experiencing? Tell me about your story.

2. Additional context or comments

3. Can you help us with this feature?

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions