Join the Fall 2022 Capstone presentation event to explore new research projects from M.S. in Data Science students.

The Capstone course provides a unique opportunity for students in the M.S. in Data Science program to apply their knowledge of the foundations, theory and methods of data science to address data driven problems in industry, government and the non-profit sector.

Course activities focus on a semester-length project sponsored by a local organization. The resulting projects synthesize the statistical, computational, engineering and social challenges involved in solving complex real-world problems.

The Fall 2022 Capstone course reflects enormous interest in data science, with 43 teams exhibiting at the presentation event. Join to explore the projects and meet with the participating students and mentors. Find project themes and companies below. 


Event Date & Time

Tuesday, December (2:00 PM – 5:00 PM ET) – IN-PERSON ONLY

Location: Pulitzer Hall, Columbia Graduate School of Journalism (Joseph D. Jamail Lecture Hall) 2950 Broadway New York, NY 10027


Agenda

2:00 PM: Join the event. Poster Presentations will be on view until 5:00 PM ET; guests are welcome to float in and out of the event to speak with the students and learn about their projects.

Research will be organized into several categories:

  • Natural Language Processing (NLP)
  • Geospatial, Time Series
  • General Machine Learning (ML)
  • Computer Vision, Deep Learning

Food and beverages will be served throughout the event!

2:30 PMIntroductions from Capstone faculty. Learn more about the Capstone program and its impact across the Data Science Institute and Columbia University at large.

5:00 PM: Event ends.


List of Exhibiting Projects

Floor Plan


Natural Language Processing (NLP)

P01: Entity Resolution and Data Analysis of Author Contribution Statements

  • Elsevier
  • Mentor: Anita de Waard
  • Faculty: Adam Kelleher
  • Students: Chuyang Xiao, Jingwen Bai, Chenxi Jiang, Yunxiao Wang, Xinyu Huang

P02: Identification of Replication “Citances”

  • Elsevier
  • Mentor: Anita de Waard
  • Faculty: Adam Kelleher
  • Students: Tengteng Tao, Zhengyi Fang, Candong Chen, Wenbo Zhao, Jinyu Wang

P03: Regulatory Requirements and Policy Standards and (Large-Language-Model) Benchmarking

  • Johnson & Johnson
  • Mentor: Michael Wiederspiel
  • Faculty: Adam Kelleher
  • Students: Daoxing Zhang, Yihao Gao, Siwen Xie, Siqi He, Vishu Tyagi

P04: Peace Speech Project

  • Earth Institute | Lamont-Doherty Earth Observatory
  • Mentor: Peter Coleman
  • Faculty: Vivian Zhang
  • Students: Hongou Liu, Yuwen Zhang, Yibo Chen, Pinyi Yang, Xinfu Su, Ziheng Ru

P05: Knowledge Graph on Unstructured Data using Unsupervised Approach for Finance Domain with Natural Language Search Enablement

  • Accenture
  • Mentor: Satish Banka
  • Faculty: Sining Chen
  • Students: Arnav Saxena, Ridwan Olawin, Elin Kim, Shashwat Singh, Alex Kita

P06: Detection of Trust in Call Center Interactions

  • Accenture
  • Mentor: Ivan Wong
  • Faculty: Sining Chen
  • Students: Anbang Wang, Binghong Yu, Huaizhi Ge, Keyi Guo, Huanyu Jiang

P07: Hierarchical Topic Modeling over Financial Documents

  • JP Morgan & Chase
  • Mentor: Simerjot Kaur
  • Faculty: Sining Chen
  • Students: Xinyu Wang, Yunchen Yao, Abel Perez-Vargas, Gilberto Garcia Perez, Pablo Ulises Hernandez Garces, Nicolo Ricca

P08: Supervised Learning Methods for Natural Language Processing

  • JP Morgan & Chase
  • Mentor: Akshat Gupta
  • Faculty: Sining Chen
  • Students: Zhirui Yang, Zonghan Yue, Zheng Wu, Qiran Li, Xianmeng Wang, Chenqi Wang

P09: Fine-Tuned Relationship Extraction for Consumer Goods Concepts (1)

  • Unilever
  • Mentor: John Labarga
  • Faculty: Sining Chen
  • Students: Wenxin Zhang, Wei Luo, Xingyu Lu, Jiazhen Li, Yinghao Li

P10: Fine-Tuned Relationship Extraction for Consumer Goods Concepts (2)

  • Unilever
  • Mentor: John Labarga
  • Faculty: Sining Chen
  • Students: Zhiqing Yang, Zhifeng Zhang, Zhucheng Zhan, Jessie Wang, Ruilin Liu

Geospatial, Time Series

P11: A Data-Driven Analysis of Socio-Economic Factors that impact Enrolment in Clinical Trials

  • Johnson & Johnson
  • Mentor: Lars Hulstaert
  • Faculty: Adam Kelleher
  • Students: Yunhan Jin, Sandy Chen, Yixuan Liu, Xingyu Wei, Zeyu Jin

P12: Prediction of Commercial Insurance Payments for Surgical Procedure using Machine Learning

  • Johnson & Johnson
  • Mentor: Cindy Tong
  • Faculty: Adam Kelleher
  • Students: Mahesh Jindal, Rahulraj Singh, Ayush Baral, Parth Gupta, Prerit Jain

P13: Prediction of Commercial Insurance Payments for Surgical Procedure using DataRobot

  • Johnson & Johnson
  • Mentor: Cindy Tong
  • Faculty: Adam Kelleher
  • Students: Sarthak Arora, Parv Joshi, Shruti Kaushal, Ryan Joseph Rogers, Tyler Marshall

P14: Placement Optimization of EV Chargers in the US

  • KPMG
  • Mentor: Chengwei Wang
  • Faculty: Adam Kelleher
  • Students: Yue Zhang, Anne Lin, Mengchen Xu, Clarissa Ruo-Ju Tai, Yu-Chieh Chen

P15: Price Optimization in Pharma through Discount Allocation via Machine Learning

  • Novartis
  • Mentor: Laura Sanchez Garcia
  • Faculty: Adam Kelleher
  • Students: Victoria Edwards, Soham Joshi, Srividya Inampudi, Vedant Rajeev Kumar, Sai Krupa Jangala
  • Time Series

P16: Extending Satellite Observations to Ocean Depths with Machine Learning

  • Earth Institute | Lamont-Doherty Earth Observatory
  • Mentor: Nicholas Bock
  • Faculty: Vivian Zhang
  • Students: Elijah Flomen, Gabrielle Nyirjesy, Blake David Hartung, Yo Xing Jeremijenko-Conley, Erin Josephine Donnelly

P17: Measurements on Greenland Surface Mass Loss with Predictions on Albedo via Machine Learning

  • Earth Institute | Lamont-Doherty Earth Observatory
  • Mentor: Marco Tedesco
  • Faculty: Vivian Zhang
  • Students: Jiawen Zhou, Ke Li, Mingyue Xu, Meggie Wen, Yuezhu Xu, Kailande Cassamajor

P18: Are Government Broadband Internet Subsidies a Waste of Money?

  • School of Engineering and Applied Science
  • Mentor: Henning Schulzrinne
  • Faculty: Vivian Zhang
  • Students: Zheyu Shen, Sitong Qian, Yifan Jiang, Yihan Wang, Shengyuan Cao, Shiyu Wang

P19: Evaluating the Attractiveness of a Country for Business Investment using World Bank Indicators

  • Accenture
  • Mentor: Paritosh Pramanik
  • Faculty: Sining Chen
  • Students: Freddy Wong, Yuan Heng, Hanlin Yan, Jace Yang, Di Mu

P20: Time Series Financial Forecasting

  • JP Morgan & Chase
  • Mentor: Simran Lamba
  • Faculty: Sining Chen
  • Students: Zhenyu Yuan, Kechengjie Zhu, Xuchen Wang, Yao Xiao, Zixiang Yin

P21: Improving the Sales Forecasting Process by Modeling the Lifecycle Events of a Drug

  • Novartis
  • Mentor: Eric Matamoros
  • Faculty: Sining Chen
  • Students: Senqi Zhang, Zehui Wu, Hang Xu, Yajie Zhang, Shuyue Xu

P22: Renewable Energy Growth Challenge

  • Accenture
  • Mentor: Bhushan Jagyasi
  • Faculty: Adam Kelleher
  • Students: Zhining Qiu, Yujia Xie, Weisheng Chen, Hongtao Jiang, Yunzhe Zhang

General Machine Learning (ML)

P23: Staying Ahead of Renewable Energy Curve, Analysis on Reusable Blades

  • NYC Matthews
  • Mentor: Terri Matthews
  • Faculty: Adam Kelleher
  • Students: Tracy Wang, Jiayuan Cui, Vipul Harashawaradhana Harihar, Sarosh Sopariwalla, Sharmi Mathur

P24: RalphLauren.com Website Search – Keyword optimization

  • Ralph Lauren
  • Mentor: Kanika Aggarwal
  • Faculty: Adam Kelleher
  • Students: Nitya Krishna Kumar, Anna Joen, Suvansh Dutta, Abhimanyu Swaroop, Ling Sun

P25: Patent Data and the Evolution of Location

  • Columbia Business School
  • Mentor: Jorge Guzman
  • Faculty: Vivian Zhang
  • Students: Shreya Verma, Arunit Maity, Mehrab Singh Gill, Sarthak Bhargava, Sanjeev Tewani, Malaika Gupta

P26: Galaxy-by-Galaxy Emulation of Cosmo-Hydrodynamical Simulations of Galaxy Formation

  • Graduate School of Arts & Sciences
  • Mentor: Shy Genel
  • Faculty: Vivian Zhang
  • Students: Junsheng Shi, Sicheng Li, Chen Jin, Wen Zhan, Shangzhi Liu

P27: MUTABLE

  • Graduate School of Arts & Sciences
  • Mentor: Yufeng Shen
  • Faculty: Vivian Zhang
  • Students: Yi Duan, Yiquan Li, Junyi Yao, Zhe Hou, Zining Chen

P28: Machine Learning in Rehabilitation Robotics

  • School of Engineering and Applied Science
  • Mentor: Sunil Agrawal
  • Faculty: Vivian Zhang
  • Students: Lea Esther, Yuren, Siyue, Tianhang, Yisi

P29: Fault Detection and Prognosis in Astronomical Observatory Operational Data in Chile

  • School of Engineering and Applied Science
  • Mentor: Vineet Goyal
  • Faculty: Vivian Zhang
  • Students: Jiang Zhu, Siqin Shen, Junhao Zhang, Yuning Ding, Yanyun Chen

P30: AI and Machine Learning Project Exploring the Clinical-Genomic Correlation of Cutaneous T-Cell Lymphoma (CTCL)

  • School of Engineering and Applied Science
  • Mentor: Itsik Pe’er
  • Faculty: Vivian Zhang
  • Students: Haoyang Shen, Lewis Wu, Sung Jun Won, George Bingham Reynolds, Adrian Garcia Hernandez

P31: Early Detection of Endometriosis from Electronic Health Record Data and Claims Data

  • Vagelos
  • Mentor: Noemie Elhadad
  • Faculty: Vivian Zhang
  • Students: Dani Masti

P32: Data Analysis of Single Cell RNA Sequencing for Neuropsychiatric Disorders

  • Vagelos
  • Mentor: Bin Xu
  • Faculty: Vivian Zhang
  • Students: Darvesh Gorhe, Katharina Fijan, Jeon Ju Hyun

P33: Accelerating Drug Discovery through Active Learning-Enhanced Virtual Screening

  • National Institutes of Health
  • Mentor: Pinyi Lu
  • Faculty: Sining Chen
  • Students: Shuqing Shan, Shanzhao Qiao, Yan Gong, Zixiang Tang, Siyu Li

Computer Vision, Deep Learning

P34: Automatic Landcover Change Detection and Classification from Satellite Images

  • JP Morgan & Chase
  • Mentor: Saba Rahimi
  • Faculty: Adam Kelleher
  • Students: Gangadhara Reddy Velagala, Vishwas Reddy Thuniki, Sai Prashanth Pathi, Nishi Amish Modi, Binny Naik
  • Computer Vision

P35: Land Cover Change Detection using Neural Network for Satellite Images

  • JP Morgan & Chase
  • Mentor: Saba Rahimi
  • Faculty: Adam Kelleher
  • Students: Ashkan Bozorgzad, Karveandhan Palanisamy, Hari Prasad Renganathan, Yuki Ikeda, Masataka Koga, Yewen Zhou
  • Computer Vision

P36: Capturing Pavement Markings using Machine Learning Algorithms

  • NYC Romano
  • Mentor: Maddalena Romano
  • Faculty: Adam Kelleher
  • Students: Moya Zhu, Megan Zhou, Ran Pan, Jingfei Fang, Zihao Zhang
  • Computer Vision

P37: Radiology Report Generation Using a Multi-Modal Prototype Network

  • Accenture
  • Mentor: Hemant Palivela
  • Faculty: Sining Chen
  • Students: Ayush Sinha, Amrutha Varshin Sundar, Navjot Singh, Vijay S Kalmath, Andrew Christopher Schaefer, Kiranmai Vasireddy
  • Deep Learning

P38: Improving Speech Transcription Accuracy by Decoding Audio with Language Model on Wav2Vec2.0 Framework

  • Accenture
  • Mentor: Bhushan Jagyasi
  • Faculty: Sining Chen
  • Students: Anh-Vu Nguyen, Sivan Ding, Maxwell Zhou, Alexandria Guo, Julia Wang Antonin Vidon
  • Deep Learning

P39: Using Remote-Sensing Data to Understand Characteristics of Vegetation, such as Species, Health, have several industry applications (1)

  • IBM
  • Mentor: Harini Srinivasan
  • Faculty: Sining Chen
  • Students: Yunze Pan, Ziyan Liu, Anshuo Wu, Shengdi Chen, Yunshu Cai
  • Computer Vision

P40: Using Remote-Sensing Data to Understand Characteristics of Vegetation, such as Species, Health, have several industry applications (2)

  • IBM
  • Mentor: Harini Srinivasan
  • Faculty: Sining Chen
  • Students: Zehua Zeng, Tianhao Wu, Juncheng Pan, Zezhong Fan, Yanbing Chen
  • Computer Vision

P41: One-Shot Learning for Face Recognition

  • JP Morgan & Chase
  • Mentor: Zhen Zeng; Kassiani Papasotiriou
  • Faculty: Sining Chen
  • Students: Pranav Gopal, Aniket Ashutosh Shahane, Vikram Singh, Jannik Jerrit Wiedenhaupt
  • Computer Vision

P42: Creating Multilingual Speech Emotion Recognition Systems JPMorgan (1)

  • JP Morgan & Chase
  • Mentor: Akshat Gupta
  • Faculty: Sining Chen
  • Students: Siddhant Pravin Mahurkar, Siddhant Rajeev Kumar, Angad Nandwani, Mridul Gupta, Eubin Park
  • Deep Learning

P43: Creating Multilingual Speech Emotion Recognition Systems JPMorgan (2)

  • JP Morgan & Chase
  • Mentor: Akshat Gupta
  • Faculty: Sining Chen
  • Students: Jingxiang Zhang, Yuxin Cui, Luwei Zhang, Shirley Gui, Ruoxi Liu, Wael Boukhobza
  • Deep Learning

Capstone Faculty

Sining Chen, Adjunct Professor of Industrial Engineering and Operations Research, Columbia University

Adam S. Kelleher, Adjunct Assistant Professor of Computer Science, Columbia University

Yuan (Vivian) Zhang, Department of Biostatistics, Columbia University