Thinking Allowed

medical / technology / education / art / flub

showing posts for 'transformer'

Constructing Transformers For Longer Sequences with Sparse Attention Methods

2021-04-14 23:38:27

"We show that carefully designed sparse attention can be as expressive and flexible as the original full attention model. Along with theoretical guarantees, we provide a very efficient implementation which allows us to scale to much longer inputs. As a consequence, we achieve state-of-the-art results...
Source: googleblog.com

Robust Neural Machine Translation: In recent years, neural machine translation (NMT) using Transformer models has experienced

2019-08-21 00:54:36

Robust Neural Machine Translation: In recent years, neural machine translation (NMT) using Transformer models has experienced tremendous success. Based on deep neural networks, NMT models are usually trained end-to-end on very large parallel corpora (input/output text pairs) in an entirely data-driven...
Source: googleblog.com

Improving Language Understanding with Unsupervised Learning: We've obtained state-of-the-art results on a suite of diverse

2018-06-12 19:07:39

Improving Language Understanding with Unsupervised Learning: We've obtained state-of-the-art results on a suite of diverse language tasks with a scalable, task-agnostic system, which we're also releasing. Our approach is a combination of two existing ideas: transformers and unsupervised pre-training....
Source: openai.com

About

Welcome to my blog. I'm a physician, educationalist, digital innovator, and medical affairs professional. Coder and founder OutcomesEngine.com. This is also the home of The Crap Artist (Official) blog posts. Dr Dean Jenkins FRCP.

Note. This is a personal blog for sharing and reflecting on my own learning. Any discussion on health matters is as accurate and comprehensive as possible but only general - it is not a substitute for the individual advice you may recieve from your own doctor. Other doctors reading this blog should use their own clinical judgement when interpreting the information and deciding how best to apply it to the care of patients.

Thinking Allowed

Constructing Transformers For Longer Sequences with Sparse Attention Methods

Robust Neural Machine Translation: In recent years, neural machine translation (NMT) using Transformer models has experienced

Improving Language Understanding with Unsupervised Learning: We've obtained state-of-the-art results on a suite of diverse

About

Feed

Archives

Elsewhere