Home / News / Stock market / Alibaba unveils Qwen3, challenges industry leaders

Stock market

Alibaba unveils Qwen3, challenges industry leaders

Investing | Tue, Apr 29 2025 08:29 AM AEST

stock

Image Source:

Alibaba unveils Qwen3, challenges industry leaders

Investing.com -- Alibaba Group Holdings Ltd ADR (NYSE:BABA) introduced its next-generation large language model series, Qwen3, on Tuesday, expanding its AI offerings across a range of model sizes and architectures. The release includes eight open-weight models, six dense and two mixture-of-experts (MoE), ranging from 0.6 billion to 235 billion parameters.

The flagship, Qwen3-235B-A22B, has demonstrated competitive performance in benchmarks across coding, mathematics, and general tasks when compared to leading models such as DeepSeek-R1, Grok-3, and Gemini-2.5-Pro. Smaller models like Qwen3-30B-A3B also outpaced more parameter-intensive models, indicating efficiency gains in structure and training.

All models—including pre-trained and post-trained variants—are publicly accessible via Hugging Face, ModelScope, and Kaggle. For deployment, Alibaba recommends SGLang and vLLM, while local users can run Qwen3 using tools like LMStudio, llama.cpp, and KTransformers.

Qwen3 offers scalable and adaptive performance, letting users tailor computational reasoning budgets to balance accuracy and resource cost. This flexibility aims to meet the increasingly diverse demands of developers integrating AI into consumer or enterprise-level workflows.

The models support 119 languages and dialects, tripling the coverage of their predecessor, Qwen2.5. This broad multilingual capability positions Qwen3 for adoption in global markets, including emerging regions with rich linguistic diversity.

Qwen3 models exhibit advances in coding and agentic functions, enhanced with deeper integration for model-conditional prompting (MCP). These refinements support sophisticated applications, such as autonomous agents and developer tooling with higher precision.

The series is trained on 36 trillion tokens, including high-quality sources from STEM, reasoning, books, and synthetic datasets. The data upgrade contributes to notable gains in language understanding, programming proficiency, and long-context memory.

Qwen3 employs architectural and training innovations such as qk layernorm and global-batch load balancing for MoE models. This leads to greater training stability and consistent performance improvements across model scales.

Its three-stage pretraining approach targets language comprehension, reasoning, and long-context processing separately, with token sequences extended up to 32,000. This modular strategy enhances Qwen3’s ability to handle complex, multi-turn interactions and larger documents.

With optimized hyperparameters guided by scaling laws for each model type, Qwen3 represents Alibaba’s most deliberate and technically comprehensive release to date. Industry observers say its open-weight strategy and multilingual reach could make it a significant contender in the global AI race.

This article first appeared in Investing.com

More For You

Stock Market

Alibaba unveils Qwen3, challenges industry leaders

Investing | Tue, Apr 29 2025 08:29 AM AEST

stock

Investing.com -- Alibaba Group Holdings Ltd ADR (NYSE:BABA) introduced...

Stock Market

Boeing outlook remains negative as S&P cites recovery risks

Investing | Tue, Apr 29 2025 08:05 AM AEST

stock

Investing.com -- S&P Global (NYSE:SPGI) Ratings affirmed Boeing Co ’...

Stock Market

TSX gains lightly on Canada’s election day

Investing | Tue, Apr 29 2025 07:53 AM AEST

stock

Tech giant NVIDIA Corporation (NASDAQ:NVDA) showed losses of 4.3% in t...