Join the Fall 2022 Capstone presentation event to explore new research projects from M.S. in Data Science students.
The Capstone course provides a unique opportunity for students in the M.S. in Data Science program to apply their knowledge of the foundations, theory and methods of data science to address data driven problems in industry, government and the non-profit sector.
Course activities focus on a semester-length project sponsored by a local organization. The resulting projects synthesize the statistical, computational, engineering and social challenges involved in solving complex real-world problems.
The Fall 2022 Capstone course reflects enormous interest in data science, with 43 teams exhibiting at the presentation event. Join to explore the projects and meet with the participating students and mentors. Find project themes and companies below.
Event Date & Time
Tuesday, December 5 (2:00 PM – 5:00 PM ET) – IN-PERSON ONLY
Location: Pulitzer Hall, Columbia Graduate School of Journalism (Joseph D. Jamail Lecture Hall) 2950 Broadway New York, NY 10027
Agenda
2:00 PM: Join the event. Poster Presentations will be on view until 5:00 PM ET; guests are welcome to float in and out of the event to speak with the students and learn about their projects.
Research will be organized into several categories:
Natural Language Processing (NLP)
Geospatial, Time Series
General Machine Learning (ML)
Computer Vision, Deep Learning
Food and beverages will be served throughout the event!
2:30 PM : Introductions from Capstone faculty. Learn more about the Capstone program and its impact across the Data Science Institute and Columbia University at large.
5:00 PM: Event ends.
List of Exhibiting Projects
Floor Plan
Natural Language Processing (NLP)
P01: Entity Resolution and Data Analysis of Author Contribution Statements
Elsevier
Mentor: Anita de Waard
Faculty: Adam Kelleher
Students: Chuyang Xiao, Jingwen Bai, Chenxi Jiang, Yunxiao Wang, Xinyu Huang
View Poster
P02: Identification of Replication “Citances”
Elsevier
Mentor: Anita de Waard
Faculty: Adam Kelleher
Students: Tengteng Tao, Zhengyi Fang, Candong Chen, Wenbo Zhao, Jinyu Wang
View Poster
P03: Regulatory Requirements and Policy Standards and (Large-Language-Model) Benchmarking
Johnson & Johnson
Mentor: Michael Wiederspiel
Faculty: Adam Kelleher
Students: Daoxing Zhang, Yihao Gao, Siwen Xie, Siqi He, Vishu Tyagi
View Po s ter
P04: Peace Speech Project
Earth Institute | Lamont-Doherty Earth Observatory
Mentor: Peter Coleman
Faculty: Vivian Zhang
Students: Hongou Liu, Yuwen Zhang, Yibo Chen, Pinyi Yang, Xinfu Su, Ziheng Ru
View Poster
P05: Knowledge Graph on Unstructured Data using Unsupervised Approach for Finance Domain with Natural Language Search Enablement
Accenture
Mentor: Satish Banka
Faculty: Sining Chen
Students: Arnav Saxena, Ridwan Olawin, Elin Kim, Shashwat Singh, Alex Kita
View Poster
P06: Detection of Trust in Call Center Interactions
Accenture
Mentor: Ivan Wong
Faculty: Sining Chen
Students: Anbang Wang, Binghong Yu, Huaizhi Ge, Keyi Guo, Huanyu Jiang
View Poster
P07: Hierarchical Topic Modeling over Financial Documents
JP Morgan & Chase
Mentor: Simerjot Kaur
Faculty: Sining Chen
Students: Xinyu Wang, Yunchen Yao, Abel Perez-Vargas, Gilberto Garcia Perez, Pablo Ulises Hernandez Garces, Nicolo Ricca
View Poster
P08: Supervised Learning Methods for Natural Language Processing
JP Morgan & Chase
Mentor: Akshat Gupta
Faculty: Sining Chen
Students: Zhirui Yang, Zonghan Yue, Zheng Wu, Qiran Li, Xianmeng Wang, Chenqi Wang
View Poster
P09: Fine-Tuned Relationship Extraction for Consumer Goods Concepts (1)
Unilever
Mentor: John Labarga
Faculty: Sining Chen
Students: Wenxin Zhang, Wei Luo, Xingyu Lu, Jiazhen Li, Yinghao Li
View Poster
P10: Fine-Tuned Relationship Extraction for Consumer Goods Concepts (2)
Unilever
Mentor: John Labarga
Faculty: Sining Chen
Students: Zhiqing Yang, Zhifeng Zhang, Zhucheng Zhan, Jessie Wang, Ruilin Liu
View Poster
Geospatial, Time Series
P11: A Data-Driven Analysis of Socio-Economic Factors that impact Enrolment in Clinical Trials
Johnson & Johnson
Mentor: Lars Hulstaert
Faculty: Adam Kelleher
Students: Yunhan Jin, Sandy Chen, Yixuan Liu, Xingyu Wei, Zeyu Jin
View Poster
P12: Prediction of Commercial Insurance Payments for Surgical Procedure using Machine Learning
Johnson & Johnson
Mentor: Cindy Tong
Faculty: Adam Kelleher
Students: Mahesh Jindal, Rahulraj Singh, Ayush Baral, Parth Gupta, Prerit Jain
View Poster
P13: Prediction of Commercial Insurance Payments for Surgical Procedure using DataRobot
Johnson & Johnson
Mentor: Cindy Tong
Faculty: Adam Kelleher
Students: Sarthak Arora, Parv Joshi, Shruti Kaushal, Ryan Joseph Rogers, Tyler Marshall
View Poster
P14: Placement Optimization of EV Chargers in the US
KPMG
Mentor: Chengwei Wang
Faculty: Adam Kelleher
Students: Yue Zhang, Anne Lin, Mengchen Xu, Clarissa Ruo-Ju Tai, Yu-Chieh Chen
View Poster
P15: Price Optimization in Pharma through Discount Allocation via Machine Learning
Novartis
Mentor: Laura Sanchez Garcia
Faculty: Adam Kelleher
Students: Victoria Edwards, Soham Joshi, Srividya Inampudi, Vedant Rajeev Kumar, Sai Krupa Jangala
Time Series
View Poster
P16: Extending Satellite Observations to Ocean Depths with Machine Learning
Earth Institute | Lamont-Doherty Earth Observatory
Mentor: Nicholas Bock
Faculty: Vivian Zhang
Students: Elijah Flomen, Gabrielle Nyirjesy, Blake David Hartung, Yo Xing Jeremijenko-Conley, Erin Josephine Donnelly
View Poster
P17: Measurements on Greenland Surface Mass Loss with Predictions on Albedo via Machine Learning
Earth Institute | Lamont-Doherty Earth Observatory
Mentor: Marco Tedesco
Faculty: Vivian Zhang
Students: Jiawen Zhou, Ke Li, Mingyue Xu, Meggie Wen, Yuezhu Xu, Kailande Cassamajor
View Poster
P18: Are Government Broadband Internet Subsidies a Waste of Money?
School of Engineering and Applied Science
Mentor: Henning Schulzrinne
Faculty: Vivian Zhang
Students: Zheyu Shen, Sitong Qian, Yifan Jiang, Yihan Wang, Shengyuan Cao, Shiyu Wang
View Poster
P19: Evaluating the Attractiveness of a Country for Business Investment using World Bank Indicators
Accenture
Mentor: Paritosh Pramanik
Faculty: Sining Chen
Students: Freddy Wong, Yuan Heng, Hanlin Yan, Jace Yang, Di Mu
View Poster
P20: Time Series Financial Forecasting
JP Morgan & Chase
Mentor: Simran Lamba
Faculty: Sining Chen
Students: Zhenyu Yuan, Kechengjie Zhu, Xuchen Wang, Yao Xiao, Zixiang Yin
View Poster
P21: Improving the Sales Forecasting Process by Modeling the Lifecycle Events of a Drug
Novartis
Mentor: Eric Matamoros
Faculty: Sining Chen
Students: Senqi Zhang, Zehui Wu, Hang Xu, Yajie Zhang, Shuyue Xu
View Poster
P22: Renewable Energy Growth Challenge
Accenture
Mentor: Bhushan Jagyasi
Faculty: Adam Kelleher
Students: Zhining Qiu, Yujia Xie, Weisheng Chen, Hongtao Jiang, Yunzhe Zhang
View Poster
General Machine Learning (ML)
P23: Staying Ahead of Renewable Energy Curve, Analysis on Reusable Blades
NYC Matthews
Mentor: Terri Matthews
Faculty: Adam Kelleher
Students: Tracy Wang, Jiayuan Cui, Vipul Harashawaradhana Harihar, Sarosh Sopariwalla, Sharmi Mathur
View Poster
P24: RalphLauren.com Website Search – Keyword optimization
Ralph Lauren
Mentor: Kanika Aggarwal
Faculty: Adam Kelleher
Students: Nitya Krishna Kumar, Anna Joen, Suvansh Dutta, Abhimanyu Swaroop, Ling Sun
View Poster
P25: Patent Data and the Evolution of Location
Columbia Business School
Mentor: Jorge Guzman
Faculty: Vivian Zhang
Students: Shreya Verma, Arunit Maity, Mehrab Singh Gill, Sarthak Bhargava, Sanjeev Tewani, Malaika Gupta
View Poster
P26: Galaxy-by-Galaxy Emulation of Cosmo-Hydrodynamical Simulations of Galaxy Formation
Graduate School of Arts & Sciences
Mentor: Shy Genel
Faculty: Vivian Zhang
Students: Junsheng Shi, Sicheng Li, Chen Jin, Wen Zhan, Shangzhi Liu
View Poster
P27: MUTABLE
Graduate School of Arts & Sciences
Mentor: Yufeng Shen
Faculty: Vivian Zhang
Students: Yi Duan, Yiquan Li, Junyi Yao, Zhe Hou, Zining Chen
View Poster
P28: Machine Learning in Rehabilitation Robotics
School of Engineering and Applied Science
Mentor: Sunil Agrawal
Faculty: Vivian Zhang
Students: Lea Esther, Yuren, Siyue, Tianhang, Yisi
View Poster
P29: Fault Detection and Prognosis in Astronomical Observatory Operational Data in Chile
School of Engineering and Applied Science
Mentor: Vineet Goyal
Faculty: Vivian Zhang
Students: Jiang Zhu, Siqin Shen, Junhao Zhang, Yuning Ding, Yanyun Chen
View Poster
P30: AI and Machine Learning Project Exploring the Clinical-Genomic Correlation of Cutaneous T-Cell Lymphoma (CTCL)
School of Engineering and Applied Science
Mentor: Itsik Pe’er
Faculty: Vivian Zhang
Students: Haoyang Shen, Lewis Wu, Sung Jun Won, George Bingham Reynolds, Adrian Garcia Hernandez
View Poster
P31: Early Detection of Endometriosis from Electronic Health Record Data and Claims Data
Vagelos
Mentor: Noemie Elhadad
Faculty: Vivian Zhang
Students: Dani Masti
View Poster
P32: Data Analysis of Single Cell RNA Sequencing for Neuropsychiatric Disorders
Vagelos
Mentor: Bin Xu
Faculty: Vivian Zhang
Students: Darvesh Gorhe, Katharina Fijan, Jeon Ju Hyun
View Poster
P33: Accelerating Drug Discovery through Active Learning-Enhanced Virtual Screening
National Institutes of Health
Mentor: Pinyi Lu
Faculty: Sining Chen
Students: Shuqing Shan, Shanzhao Qiao, Yan Gong, Zixiang Tang, Siyu Li
View Poster
Computer Vision, Deep Learning
P34: Automatic Landcover Change Detection and Classification from Satellite Images
JP Morgan & Chase
Mentor: Saba Rahimi
Faculty: Adam Kelleher
Students: Gangadhara Reddy Velagala, Vishwas Reddy Thuniki, Sai Prashanth Pathi, Nishi Amish Modi, Binny Naik
Computer Vision
View Poster
P35: Land Cover Change Detection using Neural Network for Satellite Images
JP Morgan & Chase
Mentor: Saba Rahimi
Faculty: Adam Kelleher
Students: Ashkan Bozorgzad, Karveandhan Palanisamy, Hari Prasad Renganathan, Yuki Ikeda, Masataka Koga, Yewen Zhou
Computer Vision
View Poster
P36: Capturing Pavement Markings using Machine Learning Algorithms
NYC Romano
Mentor: Maddalena Romano
Faculty: Adam Kelleher
Students: Moya Zhu, Megan Zhou, Ran Pan, Jingfei Fang, Zihao Zhang
Computer Vision
View Poster
P37: Radiology Report Generation Using a Multi-Modal Prototype Network
Accenture
Mentor: Hemant Palivela
Faculty: Sining Chen
Students: Ayush Sinha, Amrutha Varshin Sundar, Navjot Singh, Vijay S Kalmath, Andrew Christopher Schaefer, Kiranmai Vasireddy
Deep Learning
View Poster
P38: Improving Speech Transcription Accuracy by Decoding Audio with Language Model on Wav2Vec2.0 Framework
Accenture
Mentor: Bhushan Jagyasi
Faculty: Sining Chen
Students: Anh-Vu Nguyen, Sivan Ding, Maxwell Zhou, Alexandria Guo, Julia Wang Antonin Vidon
Deep Learning
View Poster
P39: Using Remote-Sensing Data to Understand Characteristics of Vegetation, such as Species, Health, have several industry applications (1)
IBM
Mentor: Harini Srinivasan
Faculty: Sining Chen
Students: Yunze Pan, Ziyan Liu, Anshuo Wu, Shengdi Chen, Yunshu Cai
Computer Vision
View Poster
P40: Using Remote-Sensing Data to Understand Characteristics of Vegetation, such as Species, Health, have several industry applications (2)
IBM
Mentor: Harini Srinivasan
Faculty: Sining Chen
Students: Zehua Zeng, Tianhao Wu, Juncheng Pan, Zezhong Fan, Yanbing Chen
Computer Vision
View Poster
P41: One-Shot Learning for Face Recognition
JP Morgan & Chase
Mentor: Zhen Zeng; Kassiani Papasotiriou
Faculty: Sining Chen
Students: Pranav Gopal, Aniket Ashutosh Shahane, Vikram Singh, Jannik Jerrit Wiedenhaupt
Computer Vision
View Poster
P42: Creating Multilingual Speech Emotion Recognition Systems JPMorgan (1)
JP Morgan & Chase
Mentor: Akshat Gupta
Faculty: Sining Chen
Students: Siddhant Pravin Mahurkar, Siddhant Rajeev Kumar, Angad Nandwani, Mridul Gupta, Eubin Park
Deep Learning
View Poster
P43: Creating Multilingual Speech Emotion Recognition Systems JPMorgan (2)
JP Morgan & Chase
Mentor: Akshat Gupta
Faculty: Sining Chen
Students: Jingxiang Zhang, Yuxin Cui, Luwei Zhang, Shirley Gui, Ruoxi Liu, Wael Boukhobza
Deep Learning
View Poster
Capstone Faculty
Sining Chen , Adjunct Professor of Industrial Engineering and Operations Research, Columbia University
Adam S. Kelleher , Adjunct Assistant Professor of Computer Science, Columbia University
Yuan (Vivian) Zhang , Department of Biostatistics, Columbia University