2.4 特征工程【stanford-cs329p】
EMNLP-2020 CodeBERT:A Pre-Trained Model for Programming and Natural Languages
POPL-2019 code2vec:Learning Distributed Representations of Code
2.3 数据变换【stanford-cs329p】
2.2 数据清理【stanford-cs329p】
CVPR-2017 Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Stanford-2022 AI Index Report
OpenAI-2022 Text and Code Embeddings by Contrastive Pre-Training
ACL-2020 Contrastive Code Representation Learning
2.1 探索性数据分析【stanford-cs329p】
Archive
Total 255 articles