GCDT: A Global Context Enhanced Deep Transition Architecture For Sequence Labeling Y Liu, F Meng, J Zhang, J Xu, Y Chen, J Zhou Proceedings of ACL 2019, 2019 | 104 | 2019 |
CM-Net: A Novel Collaborative Memory Network For Spoken Language Understanding Y Liu, F Meng, J Zhang, J Zhou, Y Chen, J Xu Proceedings of EMNLP 2019, 2019 | 75 | 2019 |
Prevent the language model from being overconfident in neural machine translation M Miao, F Meng, Y Liu, XH Zhou, J Zhou arXiv preprint arXiv:2105.11098, 2021 | 33 | 2021 |
Faster Depth-Adaptive Transformers Y Liu, F Meng, J Zhou, Y Chen, J Xu Proceedings of AAAI 2021, 2020 | 32 | 2020 |
WeChat Neural Machine Translation Systems for WMT20 F Meng, J Yan, Y Liu, Y Gao, X Zeng, Q Zeng, P Li, M Chen, J Zhou, S Liu, ... Fifth Conference on Machine Translation (WMT20), 2020 | 21 | 2020 |
Wechat neural machine translation systems for wmt21 X Zeng, Y Liu, E Li, Q Ran, F Meng, P Li, J Xu, J Zhou arXiv preprint arXiv:2108.02401, 2021 | 17 | 2021 |
Scheduled sampling based on decoding steps for neural machine translation Y Liu, F Meng, Y Chen, J Xu, J Zhou arXiv preprint arXiv:2108.12963, 2021 | 16 | 2021 |
Confidence-aware scheduled sampling for neural machine translation Y Liu, F Meng, Y Chen, J Xu, J Zhou arXiv preprint arXiv:2107.10427, 2021 | 13 | 2021 |
Improving translation faithfulness of large language models via augmenting instructions Y Chen, Y Liu, F Meng, Y Chen, J Xu, J Zhou arXiv preprint arXiv:2308.12674, 2023 | 11 | 2023 |
Bilingual mutual information based adaptive training for neural machine translation Y Xu, Y Liu, F Meng, J Zhang, J Xu, J Zhou arXiv preprint arXiv:2105.12523, 2021 | 11 | 2021 |
Conditional bilingual mutual information based adaptive training for neural machine translation S Zhang, Y Liu, F Meng, Y Chen, J Xu, J Liu, J Zhou arXiv preprint arXiv:2203.02951, 2022 | 10 | 2022 |
Depth-adaptive graph recurrent network for text classification Y Liu, F Meng, Y Chen, J Xu, J Zhou arXiv preprint arXiv:2003.00166, 2020 | 4 | 2020 |
Instruction Position Matters in Sequence Generation with Large Language Models Y Liu, X Zeng, F Meng, J Zhou arXiv preprint arXiv:2308.12097, 2023 | 3 | 2023 |
Towards robust online dialogue response generation L Cui, F Meng, Y Liu, J Zhou, Y Zhang arXiv preprint arXiv:2203.03168, 2022 | 1 | 2022 |
Towards Multiple References Era--Addressing Data Leakage and Limited Reference Diversity in NLG Evaluation X Zeng, Y Liu, F Meng, J Zho arXiv preprint arXiv:2308.03131, 2023 | | 2023 |
BranchNorm: Robustly Scaling Extremely Deep Transformers Y Liu, X Zeng, F Meng, J Zhou arXiv preprint arXiv:2305.02790, 2023 | | 2023 |