The University of Texas at Austin
Bio
I'm a Ph.D. student in the Department of Electrical and Computer Engineering at the University of
Texas at Austin. I work with
Prof. Milos
Gligoric. My research area is the
fusion of
software engineering and
natural language processing, with a focus on improving
developers' productivity during software development, testing, and maintenance. Specific topics
include combining machine learning and code execution for test completion and lemma naming, learning
to evolve code and comments, and frameworks for maintaining executable comments and
specifications.
I obtained my Bachelor's Degree at the University of Science and Technology of China, with majoring in
both Physics and Computer Science.
GitHub (pengyunie)
Google Scholar
DBLP
I will join University of Waterloo, Cheriton School
of Computer Science as an assistant professor starting from Fall 2023.
I am looking for self-motivated students with background in software engineering, programming
languages, formal methods, machine learning, and/or natural language processing.
Publications
19.
Multilingual code co-evolution using large language models.
Jiyang Zhang, Pengyu Nie, Junyi Jessy Li, and Milos Gligoric.
In Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering
(FSE'23), to appear. San Francisco, USA, December 2023.
18.
Extracting inline tests from unit tests.
Yu Liu, Pengyu Nie, Anna Guo, Milos Gligoric, and Owolabi Legunsen.
In International Symposium on Software Testing and Analysis
(ISSTA'23), to appear. Seattle, USA, July 2023.
17.
Learning deep semantics for test completion.
Pengyu Nie, Rahul Banerjee, Junyi Jessy Li, Raymond J. Mooney, and Milos Gligoric.
In International Conference on Software Engineering
(ICSE'23), to appear. Melbourne, Australia, May 2023.
16.
pytest-inline: An inline testing tool for Python.
Yu Liu, Zachary Thurston, Alan Han, Pengyu Nie, Milos Gligoric, and Owolabi Legunsen.
In International Conference on Software Engineering, Tool Demonstrations Track
(ICSEDemo'23), to appear. Melbourne, Australia, May 2023.
15.
More precise regression test selection via reasoning about semantics-modifying changes.
Yu Liu, Jiyang Zhang, Pengyu Nie, Milos Gligoric, and Owolabi Legunsen.
In International Symposium on Software Testing and Analysis
(ISSTA'23), to appear. Seattle, USA, July 2023.
14.
Impact of evaluation methodologies on code summarization.
Pengyu Nie, Jiyang Zhang, Junyi Jessy Li, Raymond J. Mooney,
and Milos Gligoric.
In Annual Meeting of the Association for Computational Linguistics
(ACL'22), 4936-4960. Dublin, Ireland, May 2022.
13.
Inline tests.
Yu Liu, Pengyu Nie, Owolabi Legunsen, and Milos Gligoric.
In International Conference on Automated Software Engineering
(ASE'22), 1-13. Oakland Center, Michigan, USA, October 2022.
12.
CoditT5: Pretraining for source code and natural language editing.
Jiyang Zhang, Sheena Panthaplackel, Pengyu Nie, Junyi Jessy Li, and Milos Gligoric.
In International Conference on Automated Software Engineering
(ASE'22), 1-12. Oakland Center, Michigan, USA, October 2022.
11.
Roosterize: Suggesting lemma names for Coq verification projects using deep learning.
Pengyu Nie, Karl Palmskog, Junyi Jessy Li,
and Milos Gligoric.
In International Conference on Software Engineering, Tool Demonstrations Track
(ICSEDemo'21), 21-24. Virtual, May 2021.
10.
Leveraging class hierarchy for code comprehension.
Jiyang Zhang, Sheena Panthaplackel, Pengyu
Nie, Junyi Jessy Li, Raymond J. Mooney, and
Milos Gligoric.
In Workshop on Computer Assisted Programming
(CAP'20). Virtual, December 2020.
9.
Unifying execution of imperative generators and declarative specifications.
[slides] [talk]
Pengyu Nie, Marinela Parovic, Zhiqiang Zang,
Sarfraz Khurshid, Aleksandar Milicevic, and Milos
Gligoric.
In Conference on Object-Oriented Programming Systems, Languages and Applications
(OOPSLA'20), 217:1-217:26. Chicago, Illinois, USA,
November 2020.
8.
On the naturalness of hardware descriptions.
[slides] [talk]
Jaeseong Lee, Pengyu Nie, Junyi Jessy Li, and
Milos Gligoric.
In Joint Meeting on European Software Engineering
Conference and Symposium on the Foundations of
Software Engineering
(FSE'20), 530-542. Sacramento, California,
USA, November 2020.
7.
Learning to format Coq code using language models.
[slides]
Pengyu Nie, Karl Palmskog, Junyi Jessy Li,
and Milos Gligoric.
In The Coq Workshop
(Coq'20). Paris, France, July 2020.
6.
Debugging the performance of Maven's test isolation: Experience report.
[slides]
Pengyu Nie, Ahmet Celik, Matthew Coley,
Aleksandar Milicevic, Jonathan Bell, and Milos
Gligoric.
In International Symposium on Software Testing and Analysis
(ISSTA'20), 249-259. Los Angeles, California, USA, July 2020.
5.
Learning to update natural language comments based on code changes.
Sheena Panthaplackel, Pengyu Nie, Milos
Gligoric, Junyi Jessy Li, and Raymond J. Mooney.
In Annual Meeting of the Association for Computational Linguistics
(ACL'20), 1853-1868. Seattle, Washington, USA, July 2020.
4.
Deep generation of Coq lemma names using
elaborated terms.
[slides] [talk]
Pengyu Nie, Karl Palmskog, Junyi Jessy Li,
and Milos Gligoric.
In International Joint Conference on Automated Reasoning
(IJCAR'20), 97-118. Paris, France, June 2020.
3.
Design, implementation, and application of
GPU-based Java bytecode interpreters.
Ahmet Celik, Pengyu Nie, Christopher
J. Rossbach, and Milos Gligoric.
In Conference on Object-Oriented Programming Systems, Languages and Applications
(OOPSLA'19), 177:1-177:28. Athens, Greece, October 2019.
2.
A framework for writing trigger-action todo
comments in executable format.
[slides]
Pengyu Nie, Rishabh Rai, Junyi Jessy Li,
Sarfraz Khurshid, Raymond J. Mooney, and Milos
Gligoric.
In Joint Meeting on European Software Engineering
Conference and Symposium on the Foundations of
Software Engineering
(FSE'19), 385-396. Tallinn, Estonia, August 2019.
This paper won an ACM SIGSOFT Distinguished Paper
Award.
1.
Natural language processing and program analysis
for supporting todo comments as software
evolves.
Pengyu Nie, Junyi Jessy Li, Sarfraz Khurshid,
Raymond J. Mooney, and Milos Gligoric.
In Workshop on Natural Language Processing for Software Engineering
(NL4SE'18), 775-778. New Orleans, Louisiana, USA, February
2018. Long presentation.
Service
2023: Reviewer for
TOSEM,
JSS.
Sub-reviewer for
ICSE,
EAAI.
2022: Reviewer for
TSE,
TOPLAS.
Sub-reviewer for
ICSE.
2021: PC member of
AAAI,
NLP4Prog,
AIST.
Sub-reviewer for
ASE,
TSE.
2020: Sub-reviewer for
ICSE,
ISSTA,
COLING,
ISSRE.
2019: Sub-reviewer for
ICSE,
ISSTA,
IJCAI.
2018: Sub-reviewer for
ASE,
FSE.
2018-2022: Co-organizer of the NLP+Programming Reading Group at UT Austin.
2022: Co-organizer of the Joint UT-Cornell Software Engineering Seminar.
2022: Committee of Graduate and Industry Networking (GAIN) at UT Austin.
2022: Mentor of the ECE Partner Program at UT Austin.