Latest in AI

Showing:parquetResearchersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Hugging Face 推出 Parquet 內容定義分塊 (CDC)：優化大規模 AI 資料集去重與傳輸效率★ 75
Hugging Face Blog368 days agoRelease
### What Is Parquet Content-Defined Chunking (CDC)? In the AI and machine learning field, dataset sizes are growing at a staggering pace. Datasets on the…
提升 Hugging Face Hub 上的 Parquet 去重（Deduplication）效率
Hugging Face Blog661 days agoRelease
The Hugging Face Hub, as the world's largest open-source AI community and dataset hosting platform, automatically converts datasets uploaded in various formats…
DuckDB：直接分析 Hugging Face Hub 上超過 50,000 個數據集★ 75
Hugging Face Blog1,147 days agoRelease
The official Hugging Face blog has announced an important integration: developers can now use DuckDB to directly analyze and query more than 50,000 open…