Science News Daily App

Huawei Introduces Pangu Ultra MoE: A 718B-Parameter Sparse Language Model Trained Efficiently on Ascend NPUs Using Simulation-Driven Architecture and System-Level Optimization

Written by

in

Sparse large language models (LLMs) based on the Mixture of Experts (MoE) framework have gained traction for their ability to scale efficiently by activating only a subset of parameters per token. This dynamic sparsity allows MoE…

Continue Reading

More posts

New injectable skin graft mimics dermis and promotes healing

August 12, 2025
NASA’s Webb May Have Found a Planet Next Door. Then It Vanished

August 12, 2025
Rule-breaking reaction unlocks a new level for photoredox catalysis | Research

August 12, 2025
Astronauts Will Return to the Moon Wearing Gold-Plated Oakleys

August 12, 2025