File:Transformer-XL.png
Transformer-XL.png (555 × 431 pixels, file size: 64 KB, MIME type: image/png)
Summary
Description | English: The Transformer-XL architecture is an approach to handle long-term dependencies in sequence data. It extends the traditional Transformer model by introducing a recurrence mechanism and relative positional encoding scheme, allowing it to model longer-term dependencies more effectively. This makes it particularly useful for tasks like language modeling, where understanding the context from earlier parts of the text can be crucial for generating accurate predictions. |
Date | 2022 |
File source | https://arxiv.org/pdf/2207.06881.pdf |
Author | Aydar Bulatov,Yuri Kuratov, Mikhail S. Burtse |
Licensing
|
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 03:32, 11 October 2023 | 555 × 431 (64 KB) | AmirhosseinAbaskohi (talk | contribs) | Uploaded a work by Aydar Bulatov,Yuri Kuratov, Mikhail S. Burtse from https://arxiv.org/pdf/2207.06881.pdf with UploadWizard |
You cannot overwrite this file.
File usage
The following page uses this file: