[Research Project] MOEB: Massive Omni Embedding Benchmark

## Goals

* **Expand Scope:** Unify and expand embedding evaluation across text, image, audio, and video, including cross-modal settings and new applications.
* **Improve Quality:** Measure capabilities better through improved datasets, robustness evaluations, and harder benchmarks.
* **Increase Efficiency:** Develop faster and more informative evaluation methodologies and infrastructure.
* **Strengthen Governance:** Improve maintainability, reproducibility, trust, and open benchmark practices.


## Tracks

### 1. Modality Expansion (MOEB)

* Unify MMTEB, MIEB, MAEB, and MVEB under an omni-modal benchmark
* Add important missing models and datasets to MTEB, MIEB, MAEB, and MVEB
* Expand cross-modal evaluations for missing combinations

### 2. Quality

* Refresh saturated benchmarks.
* Improve dataset quality and filtering.
* Harder and more robust evaluations.
* Contamination: https://github.com/embeddings-benchmark/mteb/issues/1636
* Better measurement: BrowseComp, MTEB-gym

### 3. Efficiency & Methodology

* Faster evaluation pipelines and infrastructure.
* Informative task selection.
* Benchmark compression.
* IRT and optimal experiment design.
* Partial-score estimation.

### 4. Governance

* Benchmark standards.
* Reproducibility guarantees.
* Fairness and trust considerations.
* Ensure Maintainability
* related: https://arxiv.org/html/2506.21182v1, https://github.com/embeddings-benchmark/mteb/issues/4369

### 5. Human Annotations Baselines[Optional]:

* Add Human Baselines to help us categorize problematic datasets and understand score reliability.
* related https://arxiv.org/abs/2510.10062

## Ideas to be Built on Top of MOEB

* AutoResearch with Sentence Transformers.
* Explore new domains (e.g. RAG, agents). 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Research Project] MOEB: Massive Omni Embedding Benchmark #4842

Goals

Tracks

1. Modality Expansion (MOEB)

2. Quality

3. Efficiency & Methodology

4. Governance

5. Human Annotations Baselines[Optional]:

Ideas to be Built on Top of MOEB

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

[Research Project] MOEB: Massive Omni Embedding Benchmark #4842

Description

Goals

Tracks

1. Modality Expansion (MOEB)

2. Quality

3. Efficiency & Methodology

4. Governance

5. Human Annotations Baselines[Optional]:

Ideas to be Built on Top of MOEB

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions