New managed AI models

25 February, 2026

Joshua Hartmann

Systems Engineer

Joshua hat im Sommer 2023 seine Ausbildung zum Fachinformatiker für Systemintegration bei den NETWAYS Web Services erfolgreich abgeschlossen. Heute ist er ein wichtiger Teil des Teams, das sich mit großer Hingabe um die Kundenbetreuung und die kontinuierliche Weiterentwicklung der SaaS-Apps kümmert. Neben seinem musikalischen Talent am Klavier hat Joshua eine Leidenschaft für Wintersport und findet auch Freude im Gaming. Doch am allerliebsten verbringt er seine Zeit mit seiner besseren Hälfte, denn sie ist für ihn das größte Glück.

by Joshua Hartmann | Feb 25, 2026

AI Blog

We are now running two of the strongest multilingual open source retrieval models 2025/26 fully managed via our API. This adds two important components to the NWS Managed AI Models.

Anyone who has ever worked with retrieval augmented generation (RAG), semantic search or modern AI applications such as chatbots and knowledge bases has probably come across embedding models and rerankers. These models are essential for converting texts into vectorized representations that machines can efficiently compare and retrieve relevant information.

Today we introduce you to two new Managed AI models in our portfolio.

BAAI bge-m3 and bge-reranker-v2-m3 are live!

BAAI/bge-m3 is currently the most popular multilingual embedding model
BAAI/bge-reranker-v2-m3 offers a very strong price-performance ratio when reranking

bge-m3: The Swiss army knife among embeddings

Why so many people are currently so enthusiastic about it:

Supports over 100 languages at a very high level (incl. German, French, Spanish, Arabic, Chinese, Japanese, …)
Context length up to 8192 tokens, perfect for long documents, manuals, legal texts, research papers
Supports dense retrieval, multi-vector retrieval and sparse retrieval

Typical use cases:

Semantic & hybrid search
RAG of any size
Mehrsprachige Wissensdatenbanken

bge-reranker-v2-m3: The quality booster for your top-k results

A good retriever alone is often no longer enough. The Reranker sorts your top 20 or top 50 candidates much more precisely and this is exactly where this model shines.

Why is the model so popular?

State-of-the-art in multilingual reranking
Significantly more precise than bi-encoder alone
Fast processing

Typical use cases:

Two-stage retrieval
Significantly improving RAG quality
Question-answer systems, support & internal search

Integration made easy

The models are OpenAI-compatible. This means that you can use them almost anywhere where you already use embeddings or reranking or would like to use them in the future.

Well-known frameworks & tools that work out of the box:

LangChain / LlamaIndex
Haystack
n8n
Open-WebUI
LM Studio

Quickstart with NWS Managed AI Models

You can integrate our newest models into your existing AI workflows in just 3 steps.

Step 1: Configure the API URL

The base URL for the API endpoints of both models is https://api.ai.nws.netways.de. The required slugs for both models are then as follows:

For embedding: /v1/embeddings
For reranking: /rerank

Step 2: Specify model ID

For communication via the OpenAI-compatible API endpoints, you must specify the ID of the desired model in addition to the API URL. These are as follows:

Embeddings: BAAI/bge-m3
Reranking: BAAI/bge-reranker-v2-m3

Step 3: Store the API key and get started!

Now you just need to generate an API key in MyNWS or configure an existing key. You can already use our latest Managed AI models.

With bge-m3 and bge-reranker-v2-m3 you have one of the currently most powerful, flexible and at the same time most affordable open source retrieval stacks at your fingertips, without having to worry about model hosting, quantization or GPU availability.

Whether you are building a multilingual chatbot, an internal search or a really good RAG system: this combination will deliver state-of-the-art quality with manageable effort in 2026.

Have fun trying them out!

Our portfolio

0 Comments

Submit a Comment Cancel reply

How did you like our article?