- Neural Machine Translation by Jointly Learning to Align and Translate,attention机制开山之作。2015
- Attention Is All You Need,self-attetion和Transformer。2017
- Deep contextualized word representations,ELMo。2018
- Universal Language Model Fine-tuning for Text Classification,ULMFiT。2018
- Improving Language Understanding by Generative Pre-Training,OpenAI GPT。2018
- Language Models are Unsupervised Multitask Learners,GPT-2。2019
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,2018
- Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context,2019,Transformer-xl
- XLNet: Generalized Autoregressive Pretraining for Language Understanding,2019
- RoBERTa: A Robustly Optimized BERT Pretraining Approach,2019
- cs224n,Stanford NLP
- Ask Me Anything: Dynamic Memory Networks for Natural Language Processing,2015,记忆网络之Dynamic Memory Networks,问答
- Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer,2019,T5, Google
- Bidirectional LSTM-CRF Models for Sequence Tagging,2015,
- Deep Interest Network for Click-Through Rate Prediction,推荐系统中的注意力机制——阿里深度兴趣网络DIN。Alibaba 2017
- Deep Neural Networks for YouTube Recommendations,YouTube深度推荐系统。Google 2016
- Deep & Cross Network for Ad Click Predictions,DCN,2017
- DeepFM: A Factorization-Machine based Neural Network for CTR Prediction,2017
- Wide & Deep Learning for Recommender Systems,Google 2016
- Learning Deep Structured Semantic Models for Web Search using Clickthrough Data,Microsoft 2013
- AutoInt: Automatic Feature Interaction Learning via Self-Attentive Neural Networks,2018-2019,PKU,自注意力做特征交叉
- BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer,2019,alibaba,基于上下文序列推荐
- Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs,2016,HNSW,ANN最新算法
MovieLens()
MovieLens-1M(ML-1m)
ML-20m
Amazon(Electro)
Criteo
Avazu
KDD12
Beauty
Steam
- LeNet,1998
- AlexNet,2012
- VGGNet,2014
- InceptionNet(GoogLeNet),2014
- ResNet,2015
- Learning Rich Features for Image Manipulation Detection,2018,Adobe,图片PS识别
- Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks,DCGAN,2015