Middle East Technical University
Year in School
REU Faculty Mentor
Research Area Interest
ML Pipeline Testing
Biography & Research Abstract
Developers of ML libraries often write end-to-end/integration tests that check the validity of the ML pipeline – this includes loading the data, initializing the model/algorithm, training the model, and asserting that some metrics are above a pre-selected threshold. To make testing cost-efficient, developers often use a smaller/toy version of the data (selected from the training dataset). However, it is currently unknown if the choice of data is optimal, i.e., whether it is sufficient to test that the model is learning something useful and catches potential “accuracy” bugs. Hence, the goal of the project is to study and understand the role of training “data” used for testing various ML algorithms/models and their implementations. We aim to study and validate the following Hypotheses/Questions:
- Given a (training) dataset D for a model, can we derive a smaller dataset D’ that has (1) a similar accuracy (say within ~5%) as original data, (2) has high fault-detection ability, and (3) can be trained with within a reasonable time (for CI)?
- Can we validate whether the model is learning something “useful” (e.g. via some relevant metrics) with the small dataset (derived by us/developer) used in the test?
- Can we develop data attacks that perturb the existing dataset and evaluate how that impacts the test in terms of, for instance, (1) its passing probability, (2) fault-detection ability, and (3) code coverage?
- Can we develop some interpretability metrics with respect to how the models are tested?
My name is Süleyman AteÅŸ, and I am from Turkey. I am currently sophomore METU Computer Engineering student. I love computer science and programming, and I have deep interests in the field of deep learning. Other than that, I like watching and playing football, and trekking with my friends at our campus. I like meeting new people, and getting to learn about the cultures and traditions that are new to me.