Research
Qwen 3.5: Open-source Multimodal AI Models with MoE Architecture and 201 Languages
Discover Qwen 3.5, Alibaba's family of open-source AI models that rivals GPT-5 and Claude. MoE architecture,...
-
Qwen3 ASR: Open Source Multilingual Speech Recognition
9 February 2026Qwen3 ASR, a new family of automatic speech recognition models
-
PaperBanana: The AI Tool That Automates Scientific Diagram Creation
5 February 2026In the academic world, writing scientific papers represents a considerable challenge. But beyond the text, an...
-
MedGemma 1.5 and MedASR: Google Redefines Open-Source, Multimodal Medical AI
20 January 2026MedGemma 1.5 and MedASR from Google: an open-source revolution in medical AI.
-
RAG Anything : The New Era of the RAG-Modal
10 January 2026Generative artificial intelligence has reached a decisive milestone in recent years, radically transforming how we interact...
-
Meta SAM 3D: 3D Reconstruction of Images from the Physical World
12 December 2025Generative artificial intelligence is undergoing explosive acceleration. While recent years have dazzled us with models capable...
-
DS-STAR, a versatile agent : Google for data science
30 November 2025Generative artificial intelligence has already revolutionized the world of software development. Assistants like GitHub Copilot or...
-
DeepSeek-OCR : Compression context with the 2D vision
23 October 2025Large language models (LLMs) are now capable of reasoning, writing, coding, and conversing with impressive fluency....
-
FastVLM of Apple : A Model Vision Language Ultra-Efficient
10 September 2025Apple, often seen as discreet on the public artificial intelligence stage, is making a bold move...
-
ManiFlow and DiT-X: The robotic manipulation general
10 September 2025Imagine a robot capable of learning to manipulate any object, in any environment, simply by watching...
