Machine learning model selection with multi-objective Bayesian optimization and reinforcement learning

www.lmu.de | UB | Blättern | FAQ

Zur erweiterten Suche

English

Zur erweiterten Suche

Machine learning model selection with multi-objective Bayesian optimization and reinforcement learning. case studies on functional data analysis, pipeline tuning and shifted distribution

A machine learning system, including when used in reinforcement learning, is usually fed with only limited data, while aimed at training a model with good predictive performance that can generalize to an underlying data distribution. Within certain hypothesis classes, model selection chooses a model based on selection criteria calculated from available data, which usually serve as estimators of generalization performance of the model. One major challenge for model selection that has drawn increasing attention is the discrepancy between the data distribution where training data is sampled from and the data distribution at deployment. The model can over-fit in the training distribution, and fail to extrapolate in unseen deployment distributions, which can greatly harm the reliability of a machine learning system. Such a distribution shift challenge can become even more pronounced in high-dimensional data types like gene expression data, functional data and image data, especially in a decentralized learning scenario. Another challenge for model selection is efficient search in the hypothesis space. Since training a machine learning model usually takes a fair amount of resources, searching for an appropriate model with favorable configurations is by inheritance an expensive process, thus calling for efficient optimization algorithms. To tackle the challenge of distribution shift, novel resampling methods for the evaluation of robustness of neural network was proposed, as well as a domain generalization method using multi-objective bayesian optimization in decentralized learning scenario and variational inference in a domain unsupervised manner. To tackle the expensive model search problem, combining bayesian optimization and reinforcement learning in an interleaved manner was proposed for efficient search in a hierarchical conditional configuration space. Additionally, the effectiveness of using multi-objective bayesian optimization for model search in a decentralized learning scenarios was proposed and verified. A model selection perspective to reinforcement learning was proposed with associated contributions in tackling the problem of exploration in high dimensional state action spaces and sparse reward. Connections between statistical inference and control was summarized. Additionally, contributions in open source software development in related machine learning sub-topics like feature selection and functional data analysis with advanced tuning method and abundant benchmarking were also made.

Not available

Sun, Xudong

30. Sep. 2021

2021

Englisch

Universitätsbibliothek der Ludwig-Maximilians-Universität München

https://nbn-resolving.org/urn:nbn:de:bvb:19-286698

Sun, Xudong (2021): Machine learning model selection with multi-objective Bayesian optimization and reinforcement learning: case studies on functional data analysis, pipeline tuning and shifted distribution. Dissertation, LMU München: Fakultät für Mathematik, Informatik und Statistik

Vorschau

PDF
Sun_Xudong.pdf
8MB

DOI: 10.5282/edoc.28669

URN: urn:nbn:de:bvb:19-286698

Abstract

Dokumententyp:	Dissertationen (Dissertation, LMU München)
Themengebiete:	300 Sozialwissenschaften 300 Sozialwissenschaften > 310 Statistik
Fakultäten:	Fakultät für Mathematik, Informatik und Statistik
Sprache der Hochschulschrift:	Englisch
Datum der mündlichen Prüfung:	30. September 2021
1. Berichterstatter:in:	Bischl, Bernd
MD5 Prüfsumme der PDF-Datei:	771aad5d3251eb4edbc50140c8271974
Signatur der gedruckten Ausgabe:	0001/UMC 28262
ID Code:	28669
Eingestellt am:	13. Oct. 2021 13:20
Letzte Änderungen:	13. Oct. 2021 13:56

Nur für Administratoren und Editoren: Dokument bearbeiten