Join the Fall 2022 Capstone presentation event to explore new research projects from M.S. in Data Science students.

The Capstone course provides a unique opportunity for students in the M.S. in Data Science program to apply their knowledge of the foundations, theory and methods of data science to address data driven problems in industry, government and the non-profit sector.

Course activities focus on a semester-length project sponsored by a local organization. The resulting projects synthesize the statistical, computational, engineering and social challenges involved in solving complex real-world problems.

The Fall 2022 Capstone course reflects enormous interest in data science, with 43 teams exhibiting at the presentation event. Join to explore the projects and meet with the participating students and mentors. Find project themes and companies below. 


Event Date & Time

Tuesday, December (2:00 PM – 5:00 PM ET) – IN-PERSON ONLY

Location: Pulitzer Hall, Columbia Graduate School of Journalism (Joseph D. Jamail Lecture Hall) 2950 Broadway New York, NY 10027


Agenda

2:00 PM: Join the event. Poster Presentations will be on view until 5:00 PM ET; guests are welcome to float in and out of the event to speak with the students and learn about their projects.

Research will be organized into several categories:

  • Natural Language Processing (NLP)
  • Geospatial, Time Series
  • General Machine Learning (ML)
  • Computer Vision, Deep Learning

Food and beverages will be served throughout the event!

2:30 PMIntroductions from Capstone faculty. Learn more about the Capstone program and its impact across the Data Science Institute and Columbia University at large.

5:00 PM: Event ends.


List of Exhibiting Projects

Floor Plan


Natural Language Processing (NLP)

P01: Entity Resolution and Data Analysis of Author Contribution Statements

  • Elsevier
  • Mentor: Anita de Waard
  • Faculty: Adam Kelleher
  • Students: Chuyang Xiao, Jingwen Bai, Chenxi Jiang, Yunxiao Wang, Xinyu Huang

View Poster

P02: Identification of Replication “Citances”

  • Elsevier
  • Mentor: Anita de Waard
  • Faculty: Adam Kelleher
  • Students: Tengteng Tao, Zhengyi Fang, Candong Chen, Wenbo Zhao, Jinyu Wang

View Poster

P03: Regulatory Requirements and Policy Standards and (Large-Language-Model) Benchmarking

  • Johnson & Johnson
  • Mentor: Michael Wiederspiel
  • Faculty: Adam Kelleher
  • Students: Daoxing Zhang, Yihao Gao, Siwen Xie, Siqi He, Vishu Tyagi

View Poster

P04: Peace Speech Project

  • Earth Institute | Lamont-Doherty Earth Observatory
  • Mentor: Peter Coleman
  • Faculty: Vivian Zhang
  • Students: Hongou Liu, Yuwen Zhang, Yibo Chen, Pinyi Yang, Xinfu Su, Ziheng Ru

View Poster

P05: Knowledge Graph on Unstructured Data using Unsupervised Approach for Finance Domain with Natural Language Search Enablement

  • Accenture
  • Mentor: Satish Banka
  • Faculty: Sining Chen
  • Students: Arnav Saxena, Ridwan Olawin, Elin Kim, Shashwat Singh, Alex Kita

View Poster

P06: Detection of Trust in Call Center Interactions

  • Accenture
  • Mentor: Ivan Wong
  • Faculty: Sining Chen
  • Students: Anbang Wang, Binghong Yu, Huaizhi Ge, Keyi Guo, Huanyu Jiang

View Poster

P07: Hierarchical Topic Modeling over Financial Documents

  • JP Morgan & Chase
  • Mentor: Simerjot Kaur
  • Faculty: Sining Chen
  • Students: Xinyu Wang, Yunchen Yao, Abel Perez-Vargas, Gilberto Garcia Perez, Pablo Ulises Hernandez Garces, Nicolo Ricca

View Poster

P08: Supervised Learning Methods for Natural Language Processing

  • JP Morgan & Chase
  • Mentor: Akshat Gupta
  • Faculty: Sining Chen
  • Students: Zhirui Yang, Zonghan Yue, Zheng Wu, Qiran Li, Xianmeng Wang, Chenqi Wang

View Poster

P09: Fine-Tuned Relationship Extraction for Consumer Goods Concepts (1)

  • Unilever
  • Mentor: John Labarga
  • Faculty: Sining Chen
  • Students: Wenxin Zhang, Wei Luo, Xingyu Lu, Jiazhen Li, Yinghao Li

View Poster

P10: Fine-Tuned Relationship Extraction for Consumer Goods Concepts (2)

  • Unilever
  • Mentor: John Labarga
  • Faculty: Sining Chen
  • Students: Zhiqing Yang, Zhifeng Zhang, Zhucheng Zhan, Jessie Wang, Ruilin Liu

View Poster


Geospatial, Time Series

P11: A Data-Driven Analysis of Socio-Economic Factors that impact Enrolment in Clinical Trials

  • Johnson & Johnson
  • Mentor: Lars Hulstaert
  • Faculty: Adam Kelleher
  • Students: Yunhan Jin, Sandy Chen, Yixuan Liu, Xingyu Wei, Zeyu Jin

View Poster

P12: Prediction of Commercial Insurance Payments for Surgical Procedure using Machine Learning

  • Johnson & Johnson
  • Mentor: Cindy Tong
  • Faculty: Adam Kelleher
  • Students: Mahesh Jindal, Rahulraj Singh, Ayush Baral, Parth Gupta, Prerit Jain

View Poster

P13: Prediction of Commercial Insurance Payments for Surgical Procedure using DataRobot

  • Johnson & Johnson
  • Mentor: Cindy Tong
  • Faculty: Adam Kelleher
  • Students: Sarthak Arora, Parv Joshi, Shruti Kaushal, Ryan Joseph Rogers, Tyler Marshall

View Poster

P14: Placement Optimization of EV Chargers in the US

  • KPMG
  • Mentor: Chengwei Wang
  • Faculty: Adam Kelleher
  • Students: Yue Zhang, Anne Lin, Mengchen Xu, Clarissa Ruo-Ju Tai, Yu-Chieh Chen

View Poster

P15: Price Optimization in Pharma through Discount Allocation via Machine Learning

  • Novartis
  • Mentor: Laura Sanchez Garcia
  • Faculty: Adam Kelleher
  • Students: Victoria Edwards, Soham Joshi, Srividya Inampudi, Vedant Rajeev Kumar, Sai Krupa Jangala
  • Time Series

View Poster

P16: Extending Satellite Observations to Ocean Depths with Machine Learning

  • Earth Institute | Lamont-Doherty Earth Observatory
  • Mentor: Nicholas Bock
  • Faculty: Vivian Zhang
  • Students: Elijah Flomen, Gabrielle Nyirjesy, Blake David Hartung, Yo Xing Jeremijenko-Conley, Erin Josephine Donnelly

View Poster

P17: Measurements on Greenland Surface Mass Loss with Predictions on Albedo via Machine Learning

  • Earth Institute | Lamont-Doherty Earth Observatory
  • Mentor: Marco Tedesco
  • Faculty: Vivian Zhang
  • Students: Jiawen Zhou, Ke Li, Mingyue Xu, Meggie Wen, Yuezhu Xu, Kailande Cassamajor

View Poster

P18: Are Government Broadband Internet Subsidies a Waste of Money?

  • School of Engineering and Applied Science
  • Mentor: Henning Schulzrinne
  • Faculty: Vivian Zhang
  • Students: Zheyu Shen, Sitong Qian, Yifan Jiang, Yihan Wang, Shengyuan Cao, Shiyu Wang

View Poster

P19: Evaluating the Attractiveness of a Country for Business Investment using World Bank Indicators

  • Accenture
  • Mentor: Paritosh Pramanik
  • Faculty: Sining Chen
  • Students: Freddy Wong, Yuan Heng, Hanlin Yan, Jace Yang, Di Mu

View Poster

P20: Time Series Financial Forecasting

  • JP Morgan & Chase
  • Mentor: Simran Lamba
  • Faculty: Sining Chen
  • Students: Zhenyu Yuan, Kechengjie Zhu, Xuchen Wang, Yao Xiao, Zixiang Yin

View Poster

P21: Improving the Sales Forecasting Process by Modeling the Lifecycle Events of a Drug

  • Novartis
  • Mentor: Eric Matamoros
  • Faculty: Sining Chen
  • Students: Senqi Zhang, Zehui Wu, Hang Xu, Yajie Zhang, Shuyue Xu

View Poster

P22: Renewable Energy Growth Challenge

  • Accenture
  • Mentor: Bhushan Jagyasi
  • Faculty: Adam Kelleher
  • Students: Zhining Qiu, Yujia Xie, Weisheng Chen, Hongtao Jiang, Yunzhe Zhang

View Poster


General Machine Learning (ML)

P23: Staying Ahead of Renewable Energy Curve, Analysis on Reusable Blades

  • NYC Matthews
  • Mentor: Terri Matthews
  • Faculty: Adam Kelleher
  • Students: Tracy Wang, Jiayuan Cui, Vipul Harashawaradhana Harihar, Sarosh Sopariwalla, Sharmi Mathur

View Poster

P24: RalphLauren.com Website Search – Keyword optimization

  • Ralph Lauren
  • Mentor: Kanika Aggarwal
  • Faculty: Adam Kelleher
  • Students: Nitya Krishna Kumar, Anna Joen, Suvansh Dutta, Abhimanyu Swaroop, Ling Sun

View Poster

P25: Patent Data and the Evolution of Location

  • Columbia Business School
  • Mentor: Jorge Guzman
  • Faculty: Vivian Zhang
  • Students: Shreya Verma, Arunit Maity, Mehrab Singh Gill, Sarthak Bhargava, Sanjeev Tewani, Malaika Gupta

View Poster

P26: Galaxy-by-Galaxy Emulation of Cosmo-Hydrodynamical Simulations of Galaxy Formation

  • Graduate School of Arts & Sciences
  • Mentor: Shy Genel
  • Faculty: Vivian Zhang
  • Students: Junsheng Shi, Sicheng Li, Chen Jin, Wen Zhan, Shangzhi Liu

View Poster

P27: MUTABLE

  • Graduate School of Arts & Sciences
  • Mentor: Yufeng Shen
  • Faculty: Vivian Zhang
  • Students: Yi Duan, Yiquan Li, Junyi Yao, Zhe Hou, Zining Chen

View Poster

P28: Machine Learning in Rehabilitation Robotics

  • School of Engineering and Applied Science
  • Mentor: Sunil Agrawal
  • Faculty: Vivian Zhang
  • Students: Lea Esther, Yuren, Siyue, Tianhang, Yisi

View Poster

P29: Fault Detection and Prognosis in Astronomical Observatory Operational Data in Chile

  • School of Engineering and Applied Science
  • Mentor: Vineet Goyal
  • Faculty: Vivian Zhang
  • Students: Jiang Zhu, Siqin Shen, Junhao Zhang, Yuning Ding, Yanyun Chen

View Poster

P30: AI and Machine Learning Project Exploring the Clinical-Genomic Correlation of Cutaneous T-Cell Lymphoma (CTCL)

  • School of Engineering and Applied Science
  • Mentor: Itsik Pe’er
  • Faculty: Vivian Zhang
  • Students: Haoyang Shen, Lewis Wu, Sung Jun Won, George Bingham Reynolds, Adrian Garcia Hernandez

View Poster

P31: Early Detection of Endometriosis from Electronic Health Record Data and Claims Data

  • Vagelos
  • Mentor: Noemie Elhadad
  • Faculty: Vivian Zhang
  • Students: Dani Masti

View Poster

P32: Data Analysis of Single Cell RNA Sequencing for Neuropsychiatric Disorders

  • Vagelos
  • Mentor: Bin Xu
  • Faculty: Vivian Zhang
  • Students: Darvesh Gorhe, Katharina Fijan, Jeon Ju Hyun

View Poster

P33: Accelerating Drug Discovery through Active Learning-Enhanced Virtual Screening

  • National Institutes of Health
  • Mentor: Pinyi Lu
  • Faculty: Sining Chen
  • Students: Shuqing Shan, Shanzhao Qiao, Yan Gong, Zixiang Tang, Siyu Li

View Poster


Computer Vision, Deep Learning

P34: Automatic Landcover Change Detection and Classification from Satellite Images

  • JP Morgan & Chase
  • Mentor: Saba Rahimi
  • Faculty: Adam Kelleher
  • Students: Gangadhara Reddy Velagala, Vishwas Reddy Thuniki, Sai Prashanth Pathi, Nishi Amish Modi, Binny Naik
  • Computer Vision

View Poster

P35: Land Cover Change Detection using Neural Network for Satellite Images

  • JP Morgan & Chase
  • Mentor: Saba Rahimi
  • Faculty: Adam Kelleher
  • Students: Ashkan Bozorgzad, Karveandhan Palanisamy, Hari Prasad Renganathan, Yuki Ikeda, Masataka Koga, Yewen Zhou
  • Computer Vision

View Poster

P36: Capturing Pavement Markings using Machine Learning Algorithms

  • NYC Romano
  • Mentor: Maddalena Romano
  • Faculty: Adam Kelleher
  • Students: Moya Zhu, Megan Zhou, Ran Pan, Jingfei Fang, Zihao Zhang
  • Computer Vision

View Poster

P37: Radiology Report Generation Using a Multi-Modal Prototype Network

  • Accenture
  • Mentor: Hemant Palivela
  • Faculty: Sining Chen
  • Students: Ayush Sinha, Amrutha Varshin Sundar, Navjot Singh, Vijay S Kalmath, Andrew Christopher Schaefer, Kiranmai Vasireddy
  • Deep Learning

View Poster

P38: Improving Speech Transcription Accuracy by Decoding Audio with Language Model on Wav2Vec2.0 Framework

  • Accenture
  • Mentor: Bhushan Jagyasi
  • Faculty: Sining Chen
  • Students: Anh-Vu Nguyen, Sivan Ding, Maxwell Zhou, Alexandria Guo, Julia Wang Antonin Vidon
  • Deep Learning

View Poster

P39: Using Remote-Sensing Data to Understand Characteristics of Vegetation, such as Species, Health, have several industry applications (1)

  • IBM
  • Mentor: Harini Srinivasan
  • Faculty: Sining Chen
  • Students: Yunze Pan, Ziyan Liu, Anshuo Wu, Shengdi Chen, Yunshu Cai
  • Computer Vision

View Poster

P40: Using Remote-Sensing Data to Understand Characteristics of Vegetation, such as Species, Health, have several industry applications (2)

  • IBM
  • Mentor: Harini Srinivasan
  • Faculty: Sining Chen
  • Students: Zehua Zeng, Tianhao Wu, Juncheng Pan, Zezhong Fan, Yanbing Chen
  • Computer Vision

View Poster

P41: One-Shot Learning for Face Recognition

  • JP Morgan & Chase
  • Mentor: Zhen Zeng; Kassiani Papasotiriou
  • Faculty: Sining Chen
  • Students: Pranav Gopal, Aniket Ashutosh Shahane, Vikram Singh, Jannik Jerrit Wiedenhaupt
  • Computer Vision

View Poster

P42: Creating Multilingual Speech Emotion Recognition Systems JPMorgan (1)

  • JP Morgan & Chase
  • Mentor: Akshat Gupta
  • Faculty: Sining Chen
  • Students: Siddhant Pravin Mahurkar, Siddhant Rajeev Kumar, Angad Nandwani, Mridul Gupta, Eubin Park
  • Deep Learning

View Poster

P43: Creating Multilingual Speech Emotion Recognition Systems JPMorgan (2)

  • JP Morgan & Chase
  • Mentor: Akshat Gupta
  • Faculty: Sining Chen
  • Students: Jingxiang Zhang, Yuxin Cui, Luwei Zhang, Shirley Gui, Ruoxi Liu, Wael Boukhobza
  • Deep Learning

View Poster


Capstone Faculty

Sining Chen, Adjunct Professor of Industrial Engineering and Operations Research, Columbia University

Adam S. Kelleher, Adjunct Assistant Professor of Computer Science, Columbia University

Yuan (Vivian) Zhang, Department of Biostatistics, Columbia University