← Back to Innovations
Code & Software Engineering Model Language Experimental

harrier-oss-v1

Open-Source Multilingual Text Embeddings

20,333,606 USERS
Try harrier-oss-v1 on Microsoft Foundry → Try on Microsoft Foundry →
harrier-oss-v1

About harrier-oss-v1

harrier-oss-v1 is an open-source family of multilingual text-embedding models from Microsoft that delivers strong retrieval and semantic-understanding quality across 94 languages. The models use a decoder-only architecture with last-token pooling and L2 normalization, and they accept natural-language instructions so that the same checkpoint can be specialized for retrieval, classification, clustering, or similarity tasks. Three sizes — 270M, 0.6B, and 27B parameters — are released, covering the full latency-versus-accuracy spectrum from on-device retrieval to high-throughput server workloads.

harrier-oss-v1 addresses the production need for embeddings that combine wide language coverage with deployment flexibility, eliminating the common pattern of stitching together language-specific or task-specific models. Open weights at multiple scales let teams pick a configuration matched to their latency and cost constraints while preserving consistent semantic quality across the world’s languages. The release is part of Microsoft’s broader effort to put production-grade open models — alongside Phi and BitNet — into developers’ hands so that high-quality multilingual retrieval is no longer the exclusive territory of closed APIs.

Key capabilities

  • 94 languages supported with instruction tuning
  • Decoder-only architecture with last-token pooling and L2 normalization
  • Three sizes — 270M, 0.6B, and 27B parameters
  • SOTA multilingual retrieval and semantic understanding
  • Built on the Qwen3 backbone and released as open weights
Technology Stack
PyTorch Sentence Transformers Qwen3 backbone
Technology Stack
PyTorch Sentence Transformers Qwen3 backbone

Ready to Explore?

Dive into platform integrations, source code, research papers, and announcements.