**海行（うみゆき）** @umiyuki@mstdn.soysoftware.net · 2023-04-24T02:15:11Z

海行（うみゆき） @umiyuki@mstdn.soysoftware.net

海行（うみゆき） @umiyuki@mstdn.soysoftware.net

RT @_akhaliq: Scaling Transformer to 1M tokens and beyond with RMT

Recurrent Memory Transformer retains information across up to 2 million tokens.

During inference, the model effectively utilized memory for up to 4,096 segments with a total length of 2,048,000 tokens—significantly exceeding… https://t.co/MbIegSfyb0 https://t.co/Axggo0nSoH

Apr 24, 2023, 02:15 · From Twitter · · ·

Resources

Developers

What is Mastodon?

mstdn.soysoftware.net

More…