Open source LLM models
in comparison

The right LLM solution for your project

Discover leading open source language models for productive use: Whether DeepThink reasoning, efficient inference or powerful distillation – we present a curated selection of LLMs that we at partimus actively consider in AI consulting, inference solutions and GPU configuration. With our overview, we help you to find the right model for your application – be it for complex reasoning, efficient inference, agentic use cases or research. Of course, we will be happy to advise you individually on the use of AI in your company.

Your answer to increasing computing requirements

A direct overview of the most important AI language models

Large language models are the engine of modern AI – but not all LLMs are the same. Architecture, training data, license and specialization determine which model suits your project. Whether DeepSeek, LLaMA or Qwen: each model has its own strengths and requirements. If you know these differences, you can develop in a more targeted way, scale more efficiently and create real added value – from the prototype to the productive solution. We will show you what is possible with the current language models and support you in making the best choice for your project.

February 2025

DeepSeek-V3

Basic model of the R1 series - optimized for tool use

January / May 2025

DeepSeek-R1 Series

Deep thinking through reinforcement learning

December 2024

LLaMA 3.3 70B

Meta AI's powerful 70B model

April 2025

LLaMA 4

Future model with a focus on agent systems

April 2025

Qwen3-1.7B

Compact model with surprisingly strong reasoning

April 2025

Qwen3-8B

Versatile all-rounder with RL fine-tuning

April 2025

Qwen3-14B

Strong for complex tasks & chat dialogs

April 2025

Qwen3-30B-A3B

MoE model with high performance and low consumption

April 2025

Qwen3-32B

Large dense model with thinking mode & long context length

April 2025

Qwen3-235B-A22B

Flagship model with 235 billion parameters

July 2025

Kimi-K2-Instruct

Moonshot AI's Mixture-of-Experts model with 1 trillion parameters

August 2025

GLM-4.5

Z.AI's Mixture-of-Experts model for Agentic, Reasoning & Coding applications

September 2025

GLM-4.6

Z.AI's Mixture-of-Experts model for Agentic, Reasoning & Coding applications

Get to know partimus

Get started with partimus

Take the opportunity to optimize your IT infrastructure and drive your business forward – contact us today to find out more about the tailored benefits partimus can offer you.

FAQ - Frequently asked questions

Interesting facts about AI language models

We are your competent partner in the constantly changing landscape of AI language models. We not only answer your questions about DeepSeek, LLaMA, Qwen and the like, but also advise you individually on the best language models for you and your project. We look forward to your projects and questions – feel free to contact us by phone, e-mail or via our contact form.

How do I find out which language model suits my use case?

The choice of the right model depends heavily on the intended use – whether chatbot, research, text creation or coding. Our AI experts support you in matching the architecture, size, license and performance of a model with your requirements – for maximum efficiency and minimum effort.

What are the differences between similar models such as Qwen and LLaMA?

Even similarly sized models differ significantly in terms of license, training data, tool usage or strengths such as coding, multilingualism or reasoning. We provide independent advice and help you to compare models sensibly and choose the right setup.

Can I start with a smaller model and scale up later?

Absolutely. Many models such as Qwen3-8B or LLaMA-3-8B offer high performance with low resource requirements – ideal for initial prototypes. We support you in setting up a scalable architecture that can be easily expanded as your requirements grow.

What are the benefits of combining model selection and hosting with partimus?

You receive everything from a single source: sound model consulting, technical integration, high-performance GPU infrastructure and GDPR-compliant hosting in Germany. This saves you time and money and avoids interface problems – ideal for fast and secure project implementation.

How does the model architecture affect performance and hardware requirements?

The architecture influences how efficiently a model calculates, how well it scales and what hardware is required. Our consulting services help you to select models that match your existing infrastructure – or we can provide suitable resources via our GPU Cloud.

Would you like individual advice? Our AI experts are here for you!

Open source LLM models in comparison

The right LLM solution for your project

A direct overview of the most important AI language models

Get started with partimus

Interesting facts about AI language models

Open source LLM models
in comparison