Use HF Papers

This commit is contained in:
Quentin Gallouédec
2025-05-17 03:41:24 +00:00
parent 92106c8762
commit ffb8d63324
97 changed files with 113 additions and 113 deletions

View File

@ -1,7 +1,7 @@
//! Implementation of DistilBert, a distilled version of BERT.
//!
//! See:
//! - ["DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter"](https://arxiv.org/abs/1910.01108)
//! - ["DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter"](https://huggingface.co/papers/1910.01108)
//!
use super::with_tracing::{layer_norm, linear, LayerNorm, Linear};
use candle::{DType, Device, Result, Tensor};