iCubed (Institute, Industry, Innovation) seminars invite DSI Industry Affiliates to give technical talks on work going on in their domain. Join to learn about real-world uses of data science and opportunities with Industry Affiliates.


Guest Speaker

Daniel Preoțiuc-Pietro, Senior Research Scientist, Bloomberg

Moderated By: John Hyde, DSI Assistant Director of Career Development and Alumni Services


Details

Wednesday, December 1 (5:30 PM – 6:30 PM ET) – Virtual


Abstract & Biography

Applied Named Entity Recognition @ Bloomberg

Bloomberg deals with a wealth of heterogeneous data, including (but not limited to): reports from financial analysts, earnings releases, company filings, news stories, web scrapes, social media posts, ticker symbols, and pricing information of a wide variety of securities. With 80% of financial data in unstructured formats, the ability to quickly identify named entities automatically is essential to understanding this content.

We will briefly introduce Bloomberg’s products that use this technology and present the unique challenges related to Bloomberg’s data and business. We will then present four of our recent publications that aim to tackle the following challenges: heterogeneity of content, temporal data drift, ensuring high precision in tagging or tagging entities as they are typed.

Bio: Daniel Preoțiuc-Pietro is a Senior Research Scientist at Bloomberg, where he leads the core NLP group that powers models for processing news, social media and financial documents. His research interests are focused on understanding the social and temporal aspects of text, especially from social media, with applications in domains such as Social Psychology, Law, Political Science and Journalism. Several of his research studies were featured in popular press including the Washington Post, BBC, New Scientist, Scientific American or FiveThirtyEight. He is a co-organizer of the Natural Legal Language Processing workshop series since its inception. Prior to joining Bloomberg LP, Daniel was a postdoctoral researcher at the University of Pennsylvania with the interdisciplinary World Well Being Project and obtained his PhD in Natural Language Processing and Machine Learning at the University of Sheffield, UK.