Latest in AI

Showing:llmDevelopersClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

讓你的 Llama 生成速度飛起來：使用 AWS Inferentia2 進行加速★ 72
Hugging Face Blog994 days agoTutorial
As large language models (LLMs) such as Llama 2 become more widely adopted, achieving efficient and cost-effective inference in production environments has…
如何使用 API 運行 Mistral 7B 開源語言模型
Replicate Blog1,026 days agoTutorial
Mistral 7B is a milestone open-source large language model (LLM) released by the Mistral AI team in the autumn of 2023. Despite having only 7 billion…
Hugging Face Transformers 原生支援量化方案全解析：bitsandbytes 與 GPTQ 實戰指南★ 75
Hugging Face Blog1,050 days agoTutorial
As the parameter count of large language models (LLMs) has grown dramatically, running and fine-tuning these models on consumer-grade GPUs or limited hardware…
展翅高飛：擁有 1800 億參數的 Falcon 180B 正式發布★ 75
Hugging Face Blog1,056 days agoRelease
The Technology Innovation Institute (TII) in Abu Dhabi, UAE has officially released what is currently the largest openly accessible large language model on…
Meta 推出 Code Llama：基於 Llama 2 的開源程式碼生成模型，支援 100k 上下文★ 85
Hugging Face Blog1,068 days agoRelease
Meta has officially launched Code Llama, a family of state-of-the-art open-source code generation models fine-tuned on Llama 2. Code Llama achieves leading…
Replicate API 支援語言模型串流輸出，大幅提升應用程式響應速度★ 70
Replicate Blog1,079 days agoRelease
Replicate announced that its API now officially supports streaming output for language models (LLMs). This update addresses one of the most common pain points…
使用 API 輕鬆運行 Llama 2：只需一行程式碼即可在雲端部署
Replicate Blog1,097 days agoTutorial
Meta's Llama 2 represents a landmark milestone in the history of open-source large language model (LLM) development. Its performance was regarded at the time…
Llama 2 發布首日 24 小時動態大盤點：開源社群與 Replicate 的狂歡★ 80
Replicate Blog1,105 days agoRelease
Meta officially launched the highly anticipated open-source large language model Llama 2 on July 18, 2023, immediately triggering a tsunami of cascading…
Llama 2 正式登場！已在 Hugging Face 開源上架並全面支援生態系★ 95
Hugging Face Blog1,106 days agoRelease
Meta and Microsoft jointly announced Llama 2, a new generation of open-source large language models. Compared to the original Llama, Llama 2 increases training…
Hugging Face 的開源文本生成與 LLM 生態系全景指南★ 85
Hugging Face Blog1,107 days agoRelease
This official Hugging Face blog post systematically maps out the complete ecosystem it has built around open-source large language models (LLMs). As…
使用 Hugging Face Inference Endpoints 輕鬆部署大型語言模型 (LLM)★ 75
Hugging Face Blog1,120 days agoTutorial
This official Hugging Face blog post introduces how to use their hosted service "Inference Endpoints" to deploy large language models (LLMs). With the rapid…
Falcon 系列開源模型正式登陸 Hugging Face 生態系統★ 75
Hugging Face Blog1,149 days agoRelease
The Falcon series of large language models (including Falcon-40B and Falcon-7B), developed by Abu Dhabi's Technology Innovation Institute (TII), have…
Hugging Face 整合 bitsandbytes、4-bit 量化與 QLoRA，讓大型語言模型更親民★ 90
Hugging Face Blog1,161 days agoRelease
This official Hugging Face blog post introduces a deep integration with the `bitsandbytes` library, formally adding 4-bit quantization support to…
越小越好：Q8-Chat，在 Intel Xeon 處理器上實現高效的生成式 AI 體驗
Hugging Face Blog1,169 days agoRelease
This article introduces the latest outcome of a collaboration between Hugging Face and Intel: "Q8-Chat," a project designed to demonstrate how to efficiently…
在單一 AMD GPU 上使用 ROCm 運行類似 ChatGPT 的聊天機器人
Hugging Face Blog1,170 days agoTutorial
This blog post from Hugging Face provides a detailed walkthrough of how to deploy and run an open-source ChatGPT-like chatbot on a single AMD GPU using AMD's…
Hugging Face 推出 Assisted Generation：邁向低延遲文本生成的新方向★ 85
Hugging Face Blog1,174 days agoRelease
Large language models (LLMs) typically generate text using an "autoregressive" mechanism, meaning the model must generate one token at a time. Each generation…
使用 StarCoder 打造程式助手：StarChat Alpha 正式推出
Hugging Face Blog1,176 days agoRelease
Hugging Face has announced the launch of StarChat Alpha, a conversational AI assistant designed specifically for programming. The model is based on StarCoder…
StarCoder：開源程式碼生成 LLM 的全新里程碑★ 85
Hugging Face Blog1,181 days agoRelease
The BigCode community project, led jointly by Hugging Face and ServiceNow, has officially released StarCoder (along with its base version, StarCoderBase) — a…
Replicate 發布 2023 年 4 月開源語言模型彙整：Vicuna、Dolly v2 與 StableLM 齊聚
Replicate Blog1,194 days agoRelease
The spring of 2023 was a golden era for open-source large language model (LLM) development. In April 2023, Replicate — the well-known AI model hosting platform…
Replicate 宣布正式支援大型語言模型（LLMs），提供快速啟動與串流 API
Replicate Blog1,210 days agoRelease
Replicate, the well-known AI model hosting platform, has announced official support for large language models (LLMs) on its platform. Previously, Replicate was…
在 Habana Gaudi2 加速器上實現大型語言模型快速推理：以 BLOOMZ 為例
Hugging Face Blog1,218 days agoTutorial
This article presents the results of a collaboration between Hugging Face and the Intel Habana team, focusing on how to leverage Intel's Habana Gaudi2 deep…
Hugging Face 與 AWS 達成深度合作，降低生成式 AI 部署與訓練門檻★ 75
Hugging Face Blog1,253 days agoBusiness
Open-source AI community leader Hugging Face and cloud computing giant Amazon Web Services (AWS) have announced an expanded partnership aimed at making…
Hugging Face 推出 PEFT 庫：用更低的硬體成本高效微調大型語言模型★ 85
Hugging Face Blog1,264 days agoNew Tool
As the parameter scale of large language models (LLMs) continues to grow, full fine-tuning has become prohibitively expensive and impractical. To lower the…
遊戲開發中的 AI 應用 #5：生成故事與敘事創作
Hugging Face Blog1,267 days agoTutorial
### A New Dimension of Game Storytelling: AI-Powered Dynamic Story Generation In traditional game development, writing rich, branching narratives is an…
什麼讓對話代理（Dialog Agent）變得實用？Hugging Face 深度解析★ 75
Hugging Face Blog1,281 days agoOpinion
Amid the generative AI wave sparked by ChatGPT, Hugging Face published this in-depth article exploring how to transform "base language models" — which can only…
Hugging Face Inference Endpoints 入門指南：輕鬆部署生產級 AI 模型★ 75
Hugging Face Blog1,383 days agoTutorial
Hugging Face Inference Endpoints is a fully managed service designed for developers and enterprises, built to solve the pain points of deploying machine…
超大型語言模型及其評估方法：Hugging Face 推出 Hub 上的零樣本評估★ 75
Hugging Face Blog1,394 days agoNew Tool
In late 2022, as massive language models like BLOOM and OPT emerged one after another, the AI community faced a core pain point: how to effectively and…
Hugging Face 揭秘：🤗 Accelerate 如何藉助 PyTorch 運行超大型模型★ 80
Hugging Face Blog1,400 days agoTutorial
As the parameter counts of large language models (LLMs) grow exponentially, how to load and run these models on limited hardware has become a major pain point…
使用 DeepSpeed 與 Accelerate 實現極速 BLOOM 模型推理
Hugging Face Blog1,411 days agoTutorial
BLOOM is a massive open-source multilingual model with 176 billion parameters. Running BLOOM at FP16 precision requires at least 352 GB of video memory (VRAM)…
介紹全球最大的開源多語言大語言模型：BLOOM★ 75
Hugging Face Blog1,477 days agoRelease
In July 2022, Hugging Face and the BigScience collaborative community officially released BLOOM (BigScience Large Open-science Open-access Multilingual…

← PreviousPage 3Next →

Latest in AI

讓你的 Llama 生成速度飛起來：使用 AWS Inferentia2 進行加速★ 72

如何使用 API 運行 Mistral 7B 開源語言模型

Hugging Face Transformers 原生支援量化方案全解析：bitsandbytes 與 GPTQ 實戰指南★ 75

展翅高飛：擁有 1800 億參數的 Falcon 180B 正式發布★ 75

Meta 推出 Code Llama：基於 Llama 2 的開源程式碼生成模型，支援 100k 上下文★ 85

Replicate API 支援語言模型串流輸出，大幅提升應用程式響應速度★ 70

使用 API 輕鬆運行 Llama 2：只需一行程式碼即可在雲端部署

Llama 2 發布首日 24 小時動態大盤點：開源社群與 Replicate 的狂歡★ 80

Llama 2 正式登場！已在 Hugging Face 開源上架並全面支援生態系★ 95

Hugging Face 的開源文本生成與 LLM 生態系全景指南★ 85

使用 Hugging Face Inference Endpoints 輕鬆部署大型語言模型 (LLM)★ 75

Falcon 系列開源模型正式登陸 Hugging Face 生態系統★ 75

Hugging Face 整合 bitsandbytes、4-bit 量化與 QLoRA，讓大型語言模型更親民★ 90

越小越好：Q8-Chat，在 Intel Xeon 處理器上實現高效的生成式 AI 體驗

在單一 AMD GPU 上使用 ROCm 運行類似 ChatGPT 的聊天機器人

Hugging Face 推出 Assisted Generation：邁向低延遲文本生成的新方向★ 85

使用 StarCoder 打造程式助手：StarChat Alpha 正式推出

StarCoder：開源程式碼生成 LLM 的全新里程碑★ 85

Replicate 發布 2023 年 4 月開源語言模型彙整：Vicuna、Dolly v2 與 StableLM 齊聚

Replicate 宣布正式支援大型語言模型（LLMs），提供快速啟動與串流 API

在 Habana Gaudi2 加速器上實現大型語言模型快速推理：以 BLOOMZ 為例

Hugging Face 與 AWS 達成深度合作，降低生成式 AI 部署與訓練門檻★ 75

Hugging Face 推出 PEFT 庫：用更低的硬體成本高效微調大型語言模型★ 85

遊戲開發中的 AI 應用 #5：生成故事與敘事創作

什麼讓對話代理（Dialog Agent）變得實用？Hugging Face 深度解析★ 75

Hugging Face Inference Endpoints 入門指南：輕鬆部署生產級 AI 模型★ 75

超大型語言模型及其評估方法：Hugging Face 推出 Hub 上的零樣本評估★ 75

Hugging Face 揭秘：🤗 Accelerate 如何藉助 PyTorch 運行超大型模型★ 80

使用 DeepSpeed 與 Accelerate 實現極速 BLOOM 模型推理

介紹全球最大的開源多語言大語言模型：BLOOM★ 75