Eugenie Y. Lai

Logo

Contact: eugenie.y.lai [at] gmail.com
GitHub: eugenieshine
Twitter: @EugenieLai
CV, transcript

News [More Updates]
2021.04 Joining the Data Systems Group (DSG) at MIT EECS CSAIL as a PhD student in Fall '21.

Theme by orderedlist

Iā€™m a senior undergraduate student in the Combined Major of Business and Computer Science (BUCS) program at the University of British Columbia (UBC). I have been working with Dr. Rachel Pottinger since Spring 2019 and was previously advised by Dr. Raymond Ng.

My current research focuses on data systems while applying visualization and machine learning to help users interact with and make sense of data. Today, databases provide a vital infrastructure for users to access high volumes of data, but both field-specific and database-related expertise are required for a user to interact with database applications. Such use cases motivate my work on facilitating user interaction with data systems.

Projects

Sequence-Aware Query Recommendation Using Deep Learning

Users interact with database management systems by writing sequences of queries. Those sequences encode important information. Current SQL query recommendation approaches do not take that sequence into consideration. Our work presents a novel sequence-aware approach to query recommendation. We use deep learning prediction models trained on query sequences extracted from large-scale query workloads to build our approach. We present users with contextual (query fragments) and structural (query templates) information that can aid them in formulating their next query. We thoroughly analyze query sequences in two real-world query workloads, the Sloan Digital Sky Survey (SDSS) and the SQLShare workload. Empirical results show that the sequence-aware, deep-learning approach outperforms methods that do not use sequence information. [Manuscript submitted to VLDB ā€˜21] [Poster]

PastWatch

Pastwatch helps users understand query answers by summarizing, explaining, and visualizing query provenance. Data provenance is any information about the origin of data and the process that leads to its creation. The provenance of a query over a database is a subset of the data in the database that contributed to the query answer. While comprehensive, query provenance consists of large volumes of data and hence is overwhelming for users to explore. We present an approach to provenance exploration that builds on data summarization techniques and provides an interface to visualize the summary.

Publications

Summarizing Provenance of Aggregation Query Results in Relational Databases [Short Paper]. To Appear in IEEE ICDE ā€˜21.
Omar AlOmeir, Eugenie Y. Lai, Mostafa Milani, and Rachel Pottinger.

Pastwatch: On the Usability of Provenance Data in Relational Databases [Short Paper]. IEEE ICDE ā€˜20: 1882-1885.
Omar AlOmeir, Eugenie Y. Lai, Mostafa Milani, and Rachel Pottinger.

Other

Blog for fun.

Miscellaneous to de-stress.