IreneLi

Hi there, I’m Irene Li (李紫辉)! Welcome to my blog! :)

I want to share my learning journals, notes, and programming exercises with you. The topics include data science, statistics, machine learning, deep learning, AI applications, etc.

I am now an Assistant Professor at the Information Technology Center, University of Tokyo. Please contact me via ireneli[at]ds.itc.u-tokyo.ac.jp or irene.li[at]aya.yale.edu. I am starting my own Li-Lab at Tokyo University, working in AI, ML, and NLP. I am actively looking for research assistants to join our lab for collaboration. Please drop me an email for potential research projects.

Before moving to Japan, I worked in the LILY lab at Yale University. My Ph.D. advisor is Prof. Dragomir Radev. On March 2022, I passed my Ph.D. Defense on the topic of “Neural Graph Transfer Learning in Natural Language Processing Tasks“! 🎉

👧Personal Links

Google Scholar

Twitter

For more recent work about me and my team, please check LiLab Page!

🔬Research Projects

Graph Neural Networks for NLP

LiGCN: Label-interpretable Graph Convolutional Networks for Multi-label Text Classification
Irene Li, Aosong Feng, Tianxiao Li, Hao Wu, Toyotaro Suzumura and Ruihai Dong
DLG4NLP Workshop, NAACL 2022.
Variational Graph Autoencoding as Cheap Supervision for AMR Coreference Resolution
Irene Li, Linfeng Song, Kun Xu and Dong Yu
ACL, 2022.
Efficient Variational Graph Autoencoders for Unsupervised Cross-domain Prerequisite Chains
Irene Li, Vanessa Yan and Dragomir Radev
ENLSP Workshop, NeurIPS 2021.
Unsupervised Cross-Domain Prerequisite Chain Learning using Variational Graph Autoencoders
Irene Li, Vanessa Yan, Tianxiao Li, Rihao Qu and Dragomir Radev
ACL, 2021. Slides. Github.
R-VGAE: Relational-variational Graph Autoencoder for Unsupervised Prerequisite Chain Learning
Irene Li, Alexander Fabbri, Swapnil Hingmire, and Dragomir Radev
COLING, 2020. Slides. Github.
What Should I Learn First: Introducing LectureBank for NLP Education and Prerequisite Chain Learning
Irene Li, Alexander Fabbri, Robert Tung, and Dragomir Radev
AAAI, 2019. Dataset

Transfer Learning, Attention Models

Surfer100: Generating Surveys From Web Resources, Wikipedia-style
Irene Li, Alexander Fabbri, Rina Kawamura, Yixin Liu, Xiangru Tang, Jaesung Tae, Chang Shen, Sally Ma, Tomoe Mizutani, Dragomir Radev
LREC, 2022
Improving Cross-lingual Text Classification with Zero-shot Instance-Weighting
Irene Li, Prithviraj Sen, Huaiyu Zhu, Yunyao Li and Dragomir Radev
RepL4NLP Workshop, ACL, 2021
A Neural Topic-Attention Model for Medical Term Abbreviation Disambiguation
Irene Li, Michihiro Yasunaga, Muhammed Yavuz Nuzumlalı, Cesar Caraballo, Shiwani Mahajan, Harlan Krumholz, and Dragomir Radev
Machine Learning for Health Workshop, NeurIPS 2019. Code
Detecting Bias in Transfer Learning Approaches for Text Classification
Irene Li
arXiv preprint, 2021
Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model
Alexander Fabbri, Irene Li, Tianwei She, Suyi Li, and Dragomir Radev
ACL 2019. Dataset and Code
TutorialBank: Using a Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation
Alexander Fabbri, Irene Li, Prawat Trairatvorakul, Yijiao He, Weitai Ting, Robert Tung, Caitlin Westerfield, and Dragomir Radev
ACL 2018. Dataset. BLOG.

Surveys

Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review
Irene Li, Jessica Pan, Jeremy Goldwasser, Neha Verma, Wai Pan Wong, Muhammed Yavuz Nuzumlalı, Benjamin Rosand, Yixin Li, Matthew Zhang, David Chang, R. Andrew Taylor, Harlan M. Krumholz and Dragomir Radev
Computer Science Review, volume 46, 2022

NLP for EHR, Covid

Global Data Science Project for COVID-19 Summary Report
Dario Garcia-Gasulla, Sergio Alvarez Napagao, Irene Li, Hiroshi Maruyama, Hiroki Kanezashi, Raquel P’erez-Arnal, Kunihiko Miyoshi, Euma Ishii, Keita Suzuki, Sayaka Shiba, Mariko Kurokawa, Yuta Kanzawa, Naomi Nakagawa, Masatoshi Hanai, Yixin Li, Tianxiao Li
arXiv preprint, 2020
What are We Depressed about When We Talk about COVID-19: Mental Health Analysis on Tweets Using Natural Language Processing
Irene Li, Yixin Li, Tianxiao Li, Sergio Alvarez-Napagao, Dario Garcia-Gasulla, Toyotaro Suzumura
Fortieth SGAI International Conference on Artificial Intelligence, 2020. Slides. Github

Semantic Parsing

SParC: Cross-Domain Semantic Parsing in Context
Tao Yu, Rui Zhang, Michihiro Yasunaga, Yi-Chern Tan, Xi Victoria Lin, Suyi Li, Heyang Er, Irene Li, Bo Pang, Tao Chen, Emily Ji, Shreya Dixit, David Proctor, Sungrok Shim, Jonathan Kraft, Vincent Zhang, Caiming Xiong, Richard Socher and Dragomir Radev
ACL 2019
ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks
Michihiro Yasunaga, Jungo Kasai, Rui Zhang, Alexander Fabbri, Irene Li, Dan Friedman, and Dragomir Radev
AAAI 2019
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task
Tao Yu, Rui Zhang, Kai Yang, Michihiro Yasunaga, Dongxu Wang, Zifan Li, James Ma, Irene Li, Qingning Yao, Shanelle Roman, Zilin Zhang, and Dragomir Radev
EMNLP 2018 BLOG

Earlier Research Work (Before my PhD)

Medical Text Classification Using Convolutional Neural Networks (Informatics for Health 2017), internship work at IBM Research

Research of distributed search engine based on Hadoop (DSEH) (Applied Mechanics and Materials 2013), work during Undergraduate

Research and Implementation on a Statistic Algorithm Based on Multidimensional Dynamic Reports (Applied Mechanics and Materials 2013), work during Undergraduate

📄 Recent Posts

📚Other Activities

Book Translation: AI Blueprints: How to build and deploy AI business projects (Origin) in progress, the Chinese version is on sale now on jd.com (2022)
Teaching Fellow for CPSC 477/577 Natural Language Processing at Yale (2020, Spring).
Book Translation: Learning TensorFlow (O’Reilly Media) (Origin) Chinese version is on sale now on jd.com (2018).
Deep Learning Meetup @ Dublin, Ireland: organizer and committee member (2016-2017)