Latest in AI

Showing:llmResearchersClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Llama 2 發布首日 24 小時動態大盤點：開源社群與 Replicate 的狂歡★ 80
Replicate Blog1,105 days agoRelease
Meta officially launched the highly anticipated open-source large language model Llama 2 on July 18, 2023, immediately triggering a tsunami of cascading…
Llama 2 正式登場！已在 Hugging Face 開源上架並全面支援生態系★ 95
Hugging Face Blog1,106 days agoRelease
Meta and Microsoft jointly announced Llama 2, a new generation of open-source large language models. Compared to the original Llama, Llama 2 increases training…
Hugging Face 的開源文本生成與 LLM 生態系全景指南★ 85
Hugging Face Blog1,107 days agoRelease
This official Hugging Face blog post systematically maps out the complete ecosystem it has built around open-source large language models (LLMs). As…
使用 Hugging Face Inference Endpoints 輕鬆部署大型語言模型 (LLM)★ 75
Hugging Face Blog1,120 days agoTutorial
This official Hugging Face blog post introduces how to use their hosted service "Inference Endpoints" to deploy large language models (LLMs). With the rapid…
Falcon 系列開源模型正式登陸 Hugging Face 生態系統★ 75
Hugging Face Blog1,149 days agoRelease
The Falcon series of large language models (including Falcon-40B and Falcon-7B), developed by Abu Dhabi's Technology Innovation Institute (TII), have…
Hugging Face 整合 bitsandbytes、4-bit 量化與 QLoRA，讓大型語言模型更親民★ 90
Hugging Face Blog1,161 days agoRelease
This official Hugging Face blog post introduces a deep integration with the `bitsandbytes` library, formally adding 4-bit quantization support to…
越小越好：Q8-Chat，在 Intel Xeon 處理器上實現高效的生成式 AI 體驗
Hugging Face Blog1,169 days agoRelease
This article introduces the latest outcome of a collaboration between Hugging Face and Intel: "Q8-Chat," a project designed to demonstrate how to efficiently…
在單一 AMD GPU 上使用 ROCm 運行類似 ChatGPT 的聊天機器人
Hugging Face Blog1,170 days agoTutorial
This blog post from Hugging Face provides a detailed walkthrough of how to deploy and run an open-source ChatGPT-like chatbot on a single AMD GPU using AMD's…
Hugging Face 推出 Assisted Generation：邁向低延遲文本生成的新方向★ 85
Hugging Face Blog1,174 days agoRelease
Large language models (LLMs) typically generate text using an "autoregressive" mechanism, meaning the model must generate one token at a time. Each generation…
使用 StarCoder 打造程式助手：StarChat Alpha 正式推出
Hugging Face Blog1,176 days agoRelease
Hugging Face has announced the launch of StarChat Alpha, a conversational AI assistant designed specifically for programming. The model is based on StarCoder…
StarCoder：開源程式碼生成 LLM 的全新里程碑★ 85
Hugging Face Blog1,181 days agoRelease
The BigCode community project, led jointly by Hugging Face and ServiceNow, has officially released StarCoder (along with its base version, StarCoderBase) — a…
Replicate 發布 2023 年 4 月開源語言模型彙整：Vicuna、Dolly v2 與 StableLM 齊聚
Replicate Blog1,194 days agoRelease
The spring of 2023 was a golden era for open-source large language model (LLM) development. In April 2023, Replicate — the well-known AI model hosting platform…
在 Habana Gaudi2 加速器上實現大型語言模型快速推理：以 BLOOMZ 為例
Hugging Face Blog1,218 days agoTutorial
This article presents the results of a collaboration between Hugging Face and the Intel Habana team, focusing on how to leverage Intel's Habana Gaudi2 deep…
Hugging Face 與 AWS 達成深度合作，降低生成式 AI 部署與訓練門檻★ 75
Hugging Face Blog1,253 days agoBusiness
Open-source AI community leader Hugging Face and cloud computing giant Amazon Web Services (AWS) have announced an expanded partnership aimed at making…
Hugging Face 推出 PEFT 庫：用更低的硬體成本高效微調大型語言模型★ 85
Hugging Face Blog1,264 days agoNew Tool
As the parameter scale of large language models (LLMs) continues to grow, full fine-tuning has become prohibitively expensive and impractical. To lower the…
遊戲開發中的 AI 應用 #5：生成故事與敘事創作
Hugging Face Blog1,267 days agoTutorial
### A New Dimension of Game Storytelling: AI-Powered Dynamic Story Generation In traditional game development, writing rich, branching narratives is an…
什麼讓對話代理（Dialog Agent）變得實用？Hugging Face 深度解析★ 75
Hugging Face Blog1,281 days agoOpinion
Amid the generative AI wave sparked by ChatGPT, Hugging Face published this in-depth article exploring how to transform "base language models" — which can only…
Hugging Face Inference Endpoints 入門指南：輕鬆部署生產級 AI 模型★ 75
Hugging Face Blog1,383 days agoTutorial
Hugging Face Inference Endpoints is a fully managed service designed for developers and enterprises, built to solve the pain points of deploying machine…
超大型語言模型及其評估方法：Hugging Face 推出 Hub 上的零樣本評估★ 75
Hugging Face Blog1,394 days agoNew Tool
In late 2022, as massive language models like BLOOM and OPT emerged one after another, the AI community faced a core pain point: how to effectively and…
Hugging Face 揭秘：🤗 Accelerate 如何藉助 PyTorch 運行超大型模型★ 80
Hugging Face Blog1,400 days agoTutorial
As the parameter counts of large language models (LLMs) grow exponentially, how to load and run these models on limited hardware has become a major pain point…
使用 DeepSpeed 與 Accelerate 實現極速 BLOOM 模型推理
Hugging Face Blog1,411 days agoTutorial
BLOOM is a massive open-source multilingual model with 176 billion parameters. Running BLOOM at FP16 precision requires at least 352 GB of video memory (VRAM)…
介紹全球最大的開源多語言大語言模型：BLOOM★ 75
Hugging Face Blog1,477 days agoRelease
In July 2022, Hugging Face and the BigScience collaborative community officially released BLOOM (BigScience Large Open-science Open-access Multilingual…
使用 PyTorch Fully Sharded Data Parallel (FSDP) 加速超大型模型訓練★ 75
Hugging Face Blog1,548 days agoRelease
As AI model scale has grown exponentially, training large models with billions of parameters has become the norm — but this also presents enormous hardware…
從頭開始訓練 CodeParrot 🦜：Hugging Face 的程式碼生成模型實戰指南
Hugging Face Blog1,693 days agoTutorial
This classic Hugging Face blog post documents the birth of the "CodeParrot" project — an experiment in training a code generation model entirely from scratch…
大型語言模型：全新版本的摩爾定律？
Hugging Face Blog1,736 days agoOpinion
In late 2021, the AI field witnessed an unprecedented explosive growth in large language models (LLMs). From OpenAI's GPT-3 at 175 billion parameters to the…

← PreviousPage 3

Latest in AI

Llama 2 發布首日 24 小時動態大盤點：開源社群與 Replicate 的狂歡★ 80

Llama 2 正式登場！已在 Hugging Face 開源上架並全面支援生態系★ 95

Hugging Face 的開源文本生成與 LLM 生態系全景指南★ 85

使用 Hugging Face Inference Endpoints 輕鬆部署大型語言模型 (LLM)★ 75

Falcon 系列開源模型正式登陸 Hugging Face 生態系統★ 75

Hugging Face 整合 bitsandbytes、4-bit 量化與 QLoRA，讓大型語言模型更親民★ 90

越小越好：Q8-Chat，在 Intel Xeon 處理器上實現高效的生成式 AI 體驗

在單一 AMD GPU 上使用 ROCm 運行類似 ChatGPT 的聊天機器人

Hugging Face 推出 Assisted Generation：邁向低延遲文本生成的新方向★ 85

使用 StarCoder 打造程式助手：StarChat Alpha 正式推出

StarCoder：開源程式碼生成 LLM 的全新里程碑★ 85

Replicate 發布 2023 年 4 月開源語言模型彙整：Vicuna、Dolly v2 與 StableLM 齊聚

在 Habana Gaudi2 加速器上實現大型語言模型快速推理：以 BLOOMZ 為例

Hugging Face 與 AWS 達成深度合作，降低生成式 AI 部署與訓練門檻★ 75

Hugging Face 推出 PEFT 庫：用更低的硬體成本高效微調大型語言模型★ 85

遊戲開發中的 AI 應用 #5：生成故事與敘事創作

什麼讓對話代理（Dialog Agent）變得實用？Hugging Face 深度解析★ 75

Hugging Face Inference Endpoints 入門指南：輕鬆部署生產級 AI 模型★ 75

超大型語言模型及其評估方法：Hugging Face 推出 Hub 上的零樣本評估★ 75

Hugging Face 揭秘：🤗 Accelerate 如何藉助 PyTorch 運行超大型模型★ 80

使用 DeepSpeed 與 Accelerate 實現極速 BLOOM 模型推理

介紹全球最大的開源多語言大語言模型：BLOOM★ 75

使用 PyTorch Fully Sharded Data Parallel (FSDP) 加速超大型模型訓練★ 75

從頭開始訓練 CodeParrot 🦜：Hugging Face 的程式碼生成模型實戰指南

大型語言模型：全新版本的摩爾定律？