Back to Entity Graph

🏢 Hugging Face Organization

21 articles First seen: Dec 7, 2025 Last seen: 13h ago
Activity Timeline (90 days)
Co-occurring Entities
Articles (21)
Two Loops: How China’s Open AI Strategy Reinforces Its Industrial Dominance
The document argues that China’s “open AI” strategy—centered on open-source/open-weight models—creates reinforcing feedback loops that translate model innovation into industrial dominance. First, it d
[2106.09685] LoRA: Low-Rank Adaptation of Large Language Models
LoRA (Low-Rank Adaptation of Large Language Models) addresses the growing cost of adapting very large pretrained language models to new tasks. Instead of fine-tuning all model weights, LoRA freezes th
StephanST/unidrone · Hugging Face
Unidrone v1.0 is a set of YOLOv8-based drone detection models created from the merger of two earlier models, WALDO and NANO. WALDO was optimized for nadir, or straight-overhead, imagery, while NANO pe
mshamrai/yolov8s-visdrone · Hugging Face
The content describes the Hugging Face model **mshamrai/yolov8s-visdrone**, an **object detection** model based on **YOLOv8s** and trained for the **VisDrone** dataset. It supports detection of 10 cla
Introducing talkie: a 13B vintage language model from 1930
The article introduces **talkie-1930-13b-base**, a **13B “vintage” language model** trained on **260 billion tokens of pre-1931 English text**. Vintage LMs are designed to simulate conversations with
AI and the Future of Cybersecurity: Why Openness Matters
The article argues that AI is reshaping cybersecurity, but that the real story is not any single model like Mythos. Instead, the important breakthrough is the surrounding system: large compute, securi
deepseek-ai/DeepSeek-V4-Pro · Hugging Face
DeepSeek-V4-Pro is a preview of the DeepSeek-V4 series, a pair of large Mixture-of-Experts language models designed for highly efficient long-context AI. The two main releases are DeepSeek-V4-Pro, wit
Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign · Hugging Face
Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign is a Hugging Face text-to-speech model from the Qwen3-TTS family, released in January 2026 under the Apache-2.0 license. It is a multilingual speech generation sys
[2510.25741] Scaling Latent Reasoning via Looped Language Models
The paper titled "Scaling Latent Reasoning via Looped Language Models" introduces Ouro, a family of pre-trained Looped Language Models (LoopLM) designed to enhance reasoning capabilities during the pr
Open-Sourcing Sarvam 30B and 105B | Sarvam AI
Sarvam AI is open-sourcing two models, Sarvam 30B and Sarvam 105B, which have been developed entirely in India under the IndiaAI mission. These reasoning models were trained from scratch using extensi
[2602.06176v1] Large Language Model Reasoning Failures
The paper titled "Large Language Model Reasoning Failures," authored by Peiyang Song, Pengrui Han, and Noah Goodman, presents a comprehensive survey on the reasoning shortcomings of Large Language Mod
CL-bench Leaderboard
CL-bench is a benchmarking tool designed to assess the context learning capabilities of language models. Unlike traditional evaluations that rely on pre-trained knowledge, CL-bench requires models to
[2601.20245] How AI Impacts Skill Formation
The paper titled "How AI Impacts Skill Formation" by Judy Hanwen Shen and Alex Tamkin examines the effects of AI assistance on skill development, particularly among novice workers. The authors conduct
LTX-2: Production-Grade AI Video Generation Model | LTX Model
LTX-2 is a production-grade AI video generation model that allows users to exert precise control over motion, structure, and camera behavior, enhancing creative flexibility. It features depth-aware ge
LTX-2: Production-Grade AI Video Generation Model | LTX Model
The LTX-2 model is a production-grade AI video generation tool that allows precise control over motion, structure, and camera behavior, enabling filmmakers to create visually consistent and customized
LMArena is a cancer on AI
The article critiques LMArena, an online leaderboard for AI models, arguing that it prioritizes superficial appeal over factual accuracy. Users often vote based on presentation rather than correctness
How social media encourages the worst of AI boosterism
The recent interaction on social media between AI researchers highlights the pitfalls of AI hype and miscommunication. Demis Hassabis, CEO of Google DeepMind, publicly criticized Sébastien Bubeck from
Generative AI hype distracts us from AI’s more important breakthroughs
The article discusses how the hype surrounding generative AI, exemplified by Paul McCartney's performance with a lifelike depiction of John Lennon, distracts from more impactful advancements in predic
mlabonne/gemma-3-27b-it-abliterated · Hugging Face
The article introduces the "Gemma 3 27B IT Abliterated," an uncensored variant of the original model, designed using a novel abliteration technique that enhances its resilience compared to other model
DeepWiki
DeepWiki is an AI-driven documentation tool that enables users to interactively index and understand various code repositories. It includes notable projects such as Microsoft's Visual Studio Code, Hug
We Got Claude to Fine-Tune an Open Source LLM
The article discusses the integration of Claude with a new fine-tuning tool, Hugging Face Skills, enabling Claude to fine-tune language models autonomously. This tool allows Claude to manage tasks lik
🏠Portal 📰Links Q&A 📅Events 💼Jobs