Latest in AI

Showing:webgpuResearchersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

ggml-webgpu improves prefill speeds for k-quants in llama.cpp PR
r/LocalLLaMA top day49 days agoBenchmark
llama.cpp PR #24225 improves ggml-webgpu matrix multiplication performance for k-quants and refactors matmul paths for Q4/Q5/Q8 and k-quants. In pp512 tests on an M2 Pro, reported speedups range from about 1.33x to 3.78x across Q2_K, Q3_K, Q4_K, Q5_K, and Q6_K. The largest gains appear on Q3_K models, including Qwen and Gemma examples.
Transformers.js v4 正式上架 NPM！網頁端 WebGPU AI 迎來重大效能升級★ 85
Hugging Face Blog169 days agoRelease
Hugging Face officially published Transformers.js v4 on NPM, marking a major milestone for running local AI models within the JavaScript ecosystem…
Replicate Intelligence #4：探索 GPT 模型中的概念、瀏覽器即時語音轉文字與 H100 GPU 即將上線
Replicate Blog774 days agoRelease
Replicate has published its technical newsletter, Replicate Intelligence #4, summarizing recent major developments in the AI field as well as the latest…
使用 Transformers.js 開發機器學習驅動的網頁遊戲★ 75
Hugging Face Blog1,119 days agoTutorial
This official Hugging Face blog post explores in depth how to use the Transformers.js library to run machine learning (ML) models directly in the browser…