Skip to content

add private resullts for Vultur#587

Open
Samoed wants to merge 1 commit into
mainfrom
private_vultr
Open

add private resullts for Vultur#587
Samoed wants to merge 1 commit into
mainfrom
private_vultr

Conversation

@Samoed

@Samoed Samoed commented Jul 3, 2026

Copy link
Copy Markdown
Member

Checklist

  • My model has a model sheet, report, or similar
  • My model has a reference implementation in mteb/models/model_implementations/, this can be as an API. Instruction on how to add a model can be found here
    • No, but there is an existing PR ___
  • The results submitted are obtained using the reference implementation
  • My model is available, either as a publicly accessible API or publicly on e.g., Huggingface
  • I solemnly swear that for all results submitted I have not trained on the evaluation dataset including training splits. If I have, I have disclosed it clearly.

@github-actions

github-actions Bot commented Jul 3, 2026

Copy link
Copy Markdown

Model Results Comparison

Reference models: intfloat/multilingual-e5-large, google/gemini-embedding-001
New models evaluated: vultr/VultronRetrieverCore-Qwen3.5-4.5B, vultr/VultronRetrieverPrime-Qwen3.5-8B

Results for vultr/VultronRetrieverCore-Qwen3.5-4.5B

task_name vultr/VultronRetrieverCore-Qwen3.5-4.5B Max result Model with max result In Training Data
Vidore3NuclearRetrieval .549 .538 nvidia/nemotron-colembed-vl-8b-v2 False
Vidore3TelecomRetrieval .711 .720 nvidia/nemotron-colembed-vl-8b-v2 False
Average .630 .629 nan -

Model have high performance on these tasks: Vidore3NuclearRetrieval


Results for vultr/VultronRetrieverPrime-Qwen3.5-8B

task_name vultr/VultronRetrieverPrime-Qwen3.5-8B Max result Model with max result In Training Data
Vidore3NuclearRetrieval .536 .538 nvidia/nemotron-colembed-vl-8b-v2 False
Vidore3TelecomRetrieval .713 .720 nvidia/nemotron-colembed-vl-8b-v2 False
Average .624 .629 nan -

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant