File:Recurrent Memoty Transformer.png

From UBC Wiki

Recurrent_Memoty_Transformer.png(791 × 343 pixels, file size: 65 KB, MIME type: image/png)

Summary

Description
English: RMT enhances the Transformer model by introducing global memory tokens, facilitating segment-level recurrence. It integrates unique read and write memory tokens into the input sequence, enabling the use of multiple memory tokens in each read/write block. Notably, the updated write memory representations are seamlessly passed to the subsequent segment, ensuring a dynamic and context-rich sequence processing. It leverages the ability of transformers to capture complex patterns within data, while also utilizing the sequential processing capabilities of recurrent networks. This allows the RMT to effectively handle tasks with long-term dependencies and complex sequential dynamics.
Date 2022(2022)
File source https://arxiv.org/pdf/2207.06881.pdf
Author Aydar Bulatov,Yuri Kuratov, Mikhail S. Burtse

Licensing

Some rights reserved
Permission is granted to copy, distribute and/or modify this document according to the terms in Creative Commons License, Attribution-ShareAlike 4.0. The full text of this license may be found here: CC by-sa 4.0
Attribution-Share-a-like

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current04:03, 11 October 2023Thumbnail for version as of 04:03, 11 October 2023791 × 343 (65 KB)AmirhosseinAbaskohi (talk | contribs)Uploaded a work by Aydar Bulatov,Yuri Kuratov, Mikhail S. Burtse from https://arxiv.org/pdf/2207.06881.pdf with UploadWizard

The following page uses this file: