Eugenie Y. Lai

Logo

Email: eugenie.y.lai [at] gmail.com
GitHub: eugenieshine
Twitter: @EugenieLai
CV, transcript, timeline

I’m a senior undergraduate student in the Combined Major of Business and Computer Science (BUCS) program at the University of British Columbia (UBC) Sauder School of Business. I’m a research assistant at the UBC Data Management and Mining Lab, supervised by Dr. Rachel Pottinger since Spring 2019. In Summer 2019, I was supervised by Dr. Raymond Ng in the UBC Data Science for Social Good (DSSG) program.

My current research focuses on databases while applying concepts of visualization and machine learning to help users interact with and make sense of data. Today, database systems provide a vital infrastructure for users to access high volumes of data in a variety of applications. However, both field-specific and database-related expertise are required for a user to interact with such database applications. Seeing the user-database barriers sparks my urge to centre my work around the theme of facilitating user interaction with databases, especially in knowledge exploration.

NEWS: I did a poster presentation at the National Collegiate Research Conference (NCRC) held by the Harvard College Undergraduate Research Association (HCURA) in Jan. 2021.

Research Projects

Sequence-Aware Query Recommendation Using Deep Learning

Users interact with database management systems by writing sequences of queries. Those sequences encode important information. Current SQL query recommendation approaches do not take that sequence into consideration. Our work presents a novel sequence-aware approach to query recommendation. We use deep learning prediction models trained on query sequences extracted from large-scale query workloads to build our approach. We present users with contextual (query fragments) and structural (query templates) information that can aid them in formulating their next query. We thoroughly analyze query sequences in two real-world query workloads, the Sloan Digital Sky Survey (SDSS) and the SQLShare workload. Empirical results show that the sequence-aware, deep-learning approach outperforms methods that do not use sequence information. [Submitted to VLDB ‘21] [Manuscript]

PastWatch

Pastwatch helps users understand query answers by summarizing, explaining, and visualizing query provenance. Data provenance is any information about the origin of data and the process that leads to its creation. The provenance of a query over a database is a subset of the data in the database that contributed to the query answer. While comprehensive, query provenance consists of large volumes of data and hence is overwhelming for users to explore. We present an approach to provenance exploration that builds on data summarization techniques and provides an interface to visualize the summary.

Publications

Summarizing Provenance of Aggregation Query Results in Relational Databases [Short Paper]. To Appear in IEEE ICDE ‘21.
Omar AlOmeir, Eugenie Y. Lai, Mostafa Milani, and Rachel Pottinger.

Pastwatch: On the Usability of Provenance Data in Relational Databases [Short Paper]. IEEE ICDE ‘20: 1882-1885.
Omar AlOmeir, Eugenie Y. Lai, Mostafa Milani, and Rachel Pottinger.

Blog

Things helped me discover my research interests.

Miscellaneous

Things help me destress.