Yoshua Bengio
Yoshua Bengio
Professor of computer science, University of Montreal, Mila, IVADO, CIFAR
Verified email at umontreal.ca - Homepage
Title
Cited by
Cited by
Year
Deep learning
Y LeCun, Y Bengio, G Hinton
nature 521 (7553), 436-444, 2015
428352015
Gradient-based learning applied to document recognition
Y LeCun, L Bottou, Y Bengio, P Haffner
Proceedings of the IEEE 86 (11), 2278-2324, 1998
395741998
Generative adversarial nets
I Goodfellow, J Pouget-Abadie, M Mirza, B Xu, D Warde-Farley, S Ozair, ...
Advances in neural information processing systems 27, 2014
351082014
Deep learning
I Goodfellow, Y Bengio, A Courville
MIT press, 2016
310612016
Neural machine translation by jointly learning to align and translate
D Bahdanau, K Cho, Y Bengio
arXiv preprint arXiv:1409.0473, 2014
197932014
Learning phrase representations using RNN encoder-decoder for statistical machine translation
K Cho, B Van Merriënboer, C Gulcehre, D Bahdanau, F Bougares, ...
arXiv preprint arXiv:1406.1078, 2014
154932014
Understanding the difficulty of training deep feedforward neural networks
X Glorot, Y Bengio
Proceedings of the thirteenth international conference on artificial …, 2010
136792010
Learning deep architectures for AI
Y Bengio
Now Publishers Inc, 2009
99112009
Representation learning: A review and new perspectives
Y Bengio, A Courville, P Vincent
IEEE transactions on pattern analysis and machine intelligence 35 (8), 1798-1828, 2013
96172013
A Neural probabilistic language model
Y Bengio, R Ducharme, P Vincent
Journal of Machine Learning Research 3, 1137-1155, 2003
77422003
Show, attend and tell: Neural image caption generation with visual attention
K Xu, J Ba, R Kiros, K Cho, A Courville, R Salakhudinov, R Zemel, ...
International conference on machine learning, 2048-2057, 2015
76512015
Empirical evaluation of gated recurrent neural networks on sequence modeling
J Chung, C Gulcehre, KH Cho, Y Bengio
arXiv preprint arXiv:1412.3555, 2014
74682014
Deep sparse rectifier neural networks
X Glorot, A Bordes, Y Bengio
Proceedings of the fourteenth international conference on artificial …, 2011
74012011
Learning long-term dependencies with gradient descent is difficult
Y Bengio, P Simard, P Frasconi
IEEE transactions on neural networks 5 (2), 157-166, 1994
72381994
Random search for hyper-parameter optimization.
J Bergstra, Y Bengio
Journal of machine learning research 13 (2), 2012
64222012
Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion.
P Vincent, H Larochelle, I Lajoie, Y Bengio, PA Manzagol, L Bottou
Journal of machine learning research 11 (12), 2010
63682010
How transferable are features in deep neural networks?
J Yosinski, J Clune, Y Bengio, H Lipson
arXiv preprint arXiv:1411.1792, 2014
62842014
Extracting and composing robust features with denoising autoencoders
P Vincent, H Larochelle, Y Bengio, PA Manzagol
Proceedings of the 25th international conference on Machine learning, 1096-1103, 2008
59482008
Greedy layer-wise training of deep networks
Y Bengio, P Lamblin, D Popovici, H Larochelle
Advances in neural information processing systems, 153-160, 2007
56672007
Graph attention networks
P Velièkoviæ, G Cucurull, A Casanova, A Romero, P Lio, Y Bengio
arXiv preprint arXiv:1710.10903, 2017
49412017
The system can't perform the operation now. Try again later.
Articles 1–20