• Neural Pulse
  • Posts
  • Chinese AI Models Surpass US in Global Downloads

Chinese AI Models Surpass US in Global Downloads

A comprehensive analysis of 851,000 models from June 2020 to August 2025...


Hey there 👋

We hope you're excited to discover what's new and trending in AI, ML, and data science this week.

Here is your 5-minute pulse...

print("News & Trends")

Image source: Data Provenance Initiative

The Hugging Face Model Hub has transformed into a global nexus for open-weight AI models, hosting over 2 million models with 1.7 billion downloads. A comprehensive analysis of 851,000 models from June 2020 to August 2025 reveals a significant shift: U.S. industry leaders like Google, Meta, and OpenAI have seen their dominance wane, while unaffiliated developers and Chinese firms such as DeepSeek and Qwen are gaining ground. Notably, the average model size has surged 17-fold, and there's been a marked rise in multimodal generation, quantization, and mixture-of-experts architectures. However, data transparency is on the decline, with open-weight models surpassing truly open-source ones for the first time in 2025. This study underscores the evolving dynamics of power and participation in the open AI model ecosystem.

Image source: Jerry Liy

Jerry Liu introduces LlamaSheets, an API that transforms complex Excel sheets into well-structured 2D tables, enabling Claude to process them more effectively. This advancement allows for seamless integration with Pandas and SQL, enhancing data analysis capabilities. A comprehensive guide is available for implementing LlamaSheets with coding agents.

Image source: Microsoft

Microsoft introduces Fara-7B, a compact 7-billion parameter agentic model designed for computer use. Utilizing FaraGen, a synthetic data generation system, Fara-7B learns from diverse, multi-step web tasks. It processes computer interfaces through screenshots and executes actions via predicted coordinates, enabling on-device deployment. Despite its smaller size, Fara-7B outperforms similar models and competes with larger ones on benchmarks like WebVoyager and WebTailBench. Microsoft has released Fara-7B's open weights on Microsoft Foundry and Hugging Face, along with the WebTailBench benchmark.

Image source: Sonar

Sonar's latest report delves into the unique "coding personalities" of top LLMs, revealing that while these models excel at generating syntactically correct code and translating between languages, they also share significant flaws, such as introducing high-severity vulnerabilities and producing hard-to-maintain code. The study emphasizes the need for a "trust but verify" approach, highlighting that newer models may improve performance benchmarks but can also increase the risk of severe bugs.

print("Applications & Insights")

The Pragmatic Guide to Federated AI: Building Compliant LLM/XGBoost Pipelines for Sensitive Data (5 min. read)
This article delves into constructing federated AI pipelines that integrate Large Language Models (LLMs) and XGBoost, focusing on handling sensitive data while ensuring compliance. It offers practical strategies for data scientists and ML engineers to implement federated learning frameworks that maintain data privacy and adhere to regulatory standards. The guide emphasizes the importance of balancing model performance with stringent data protection measures, providing insights into the challenges and solutions in deploying compliant AI systems.

Claude 4.5 Opus' Soul Document (38 min. read)
This article delves into the "soul document" of Claude 4.5 Opus, offering a rare glimpse into the guiding principles and internal directives shaping this AI model's behavior. It explores how these foundational instructions influence Claude's decision-making processes, ethical considerations, and alignment with human values. For data scientists and ML engineers, this piece provides valuable insights into the complexities of AI alignment and the challenges of embedding ethical frameworks within advanced language models.

Context Plumbing (5 min. read)
Matt Webb delves into the intricacies of building AI systems that seamlessly interpret user intent by dynamically managing context. He emphasizes the importance of minimizing the gap between user desires and system responses, highlighting the role of 'context engineering' in providing AI with timely, relevant information. Webb likens this process to 'plumbing,' where the challenge lies in efficiently channeling ever-changing contextual data to where it's needed, ensuring AI agents can act swiftly and accurately.

The End of the Train-Test Split (5 min. read)
This article explores the limitations of traditional train-test splits in complex classification tasks, particularly those requiring large language models (LLMs). It highlights challenges in policy-driven classifications, such as detecting sexually suggestive content, where ambiguous guidelines and inconsistent labeling hinder model performance. The author suggests that instead of relying on extensive training datasets, providing LLMs with clear, natural language rules and a few high-quality examples can lead to better outcomes.

print("Tools & Resources")

TRENDING MODELS

Text-to-Image
Tongyi-MAI/Z-Image-Turbo
⇧ 86.6k Downloads
Z-Image-Turbo is a text-to-image model designed to generate high-quality images from textual descriptions.

Image-to-Image
black-forest-labs/FLUX.2-dev
⇧ 181k Downloads
FLUX.2-dev is an image-to-image model that enables advanced image transformations and enhancements.

Text Generation
deepseek-ai/DeepSeek-Math-V2
⇧ 5.73k Downloads
DeepSeek-Math-V2 is a text generation model optimized for solving complex mathematical problems and generating mathematical content.

Image-Text-to-Text
tencent/HunyuanOCR
⇧ 186k Downloads
HunyuanOCR is an image-to-text model developed by Tencent for optical character recognition, converting images of text into editable formats.

Text Generation
deepseek-ai/DeepSeek-V3.2
⇧ 2.89k Downloads
DeepSeek-V3.2 is a text generation model designed to produce coherent and contextually relevant text across various applications.

TRENDING AI TOOLS

  • 🧠 Jupyter AI: Integrates generative AI into Jupyter notebooks for enhanced interactive computing.

  • 🧮 Math v2: Advanced mathematical operations for deep learning applications.

  • 🔍 GELab-Zero-4B: First complete open-source GUI Agent with model + infrastructure

print("Everything else")

That’s it for today!

Before you go, we’d love to know what you thought of today's newsletter to help us improve the pulse experience for you.

What did you think of today's pulse?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.

See you soon,

Andres