CLASS_VISION Telegram 574
Forwarded from Tensorflow(@CVision) (Alireza Akhavan)
#مقاله

The Evolved Transformer
The Evolved Transformer: They perform architecture search on Transformer's stackable cells for seq2seq tasks. “A much smaller, mobile-friendly, Evolved Transformer with only ~7M parameters outperforms the original Transformer by 0.7 BLEU on WMT14 EN-DE.”
https://arxiv.org/abs/1901.11117

The Evolved Transformer is twice as efficient as the Transformer in FLOPS without loss in quality.

#seq2seq



tgoop.com/class_vision/574
Create:
Last Update:

#مقاله

The Evolved Transformer
The Evolved Transformer: They perform architecture search on Transformer's stackable cells for seq2seq tasks. “A much smaller, mobile-friendly, Evolved Transformer with only ~7M parameters outperforms the original Transformer by 0.7 BLEU on WMT14 EN-DE.”
https://arxiv.org/abs/1901.11117

The Evolved Transformer is twice as efficient as the Transformer in FLOPS without loss in quality.

#seq2seq

BY 🚀 کلاس‌ویژن | یادگیری هوش مصنوعی از پایه تا پیشرفته




Share with your friend now:
tgoop.com/class_vision/574

View MORE
Open in Telegram


Telegram News

Date: |

Polls How to Create a Private or Public Channel on Telegram? Concise The optimal dimension of the avatar on Telegram is 512px by 512px, and it’s recommended to use PNG format to deliver an unpixelated avatar. The imprisonment came as Telegram said it was "surprised" by claims that privacy commissioner Ada Chung Lai-ling is seeking to block the messaging app due to doxxing content targeting police and politicians.
from us


Telegram 🚀 کلاس‌ویژن | یادگیری هوش مصنوعی از پایه تا پیشرفته
FROM American