Maximilian Dax
Machine Learning Researcher and Physicist

EmailLinkTwitterGitHub

Hi!  I'm a PhD candidate at the MPI for Intelligent Systems (Tübingen, Germany), advised by Bernhard Schölkopf. I'm broadly interested in machine learning, with a focus on probabilistic modeling and density estimation for astrophysics. Together with my collaborators, I have developed the machine learning method "Dingo" for fast and highly accurate Bayesian inference of gravitational-wave data. 

I have studied physics in Bonn (Germany) and Uppsala (Sweden), with a research focus on theoretical particle physics. During my master studies, I spent some time at Bosch doing computer vision research.

Highlighted publications

Maximilian Dax*, Stephen R Green*, Jonathan Gair, Michael Pürrer, Jonas Wildberger, Jakob H. Macke, Alessandra Buonanno, Bernhard Schölkopf (2022). Neural Importance Sampling for Rapid and Reliable Gravitational-Wave Inference. preprint [pdf] 

Neural Importance Sampling for Rapid and Reliable Gravitational-Wave Inference

We combine amortized neural posterior estimation with importance sampling for fast and accurate gravitational-wave inference. We first generate a rapid proposal for the Bayesian posterior using neural networks, and then attach importance weights based on the underlying likelihood and prior. This provides (1) a corrected posterior free from network inaccuracies, (2) a performance diagnostic (the sample efficiency) for assessing the proposal and identifying failure cases, and (3) an unbiased estimate of the Bayesian evidence. By establishing this independent verification and correction mechanism we address some of the most frequent criticisms against deep learning for scientific inference. We carry out a large study analyzing 42 binary black hole mergers observed by LIGO and Virgo with the SEOBNRv4PHM and IMRPhenomXPHM waveform models. This shows a median sample efficiency of ≈ 10% (two orders-of-magnitude better than standard samplers) as well as a ten-fold reduction in the statistical uncertainty in the log evidence. Given these advantages, we expect a significant impact on gravitational-wave inference, and for this approach to serve as a paradigm for harnessing deep learning methods in scientific applications.

Maximilian Dax, Stephen R Green, Jonathan Gair, Michael Deistler, Bernhard Schölkopf, Jakob H. Macke (2022).  Group equivariant neural posterior estimation. ICLR 2022 [pdf] 

Group equivariant neural posterior estimation

Simulation-based inference with conditional neural density estimators is a powerful approach to solving inverse problems in science. However, these methods typically treat the underlying forward model as a black box, with no way to exploit geometric properties such as equivariances. Equivariances are common in scientific models, however integrating them directly into expressive inference networks (such as normalizing flows) is not straightforward. We here describe an alternative method to incorporate equivariances under joint transformations of parameters and data. Our method—called group equivariant neural posterior estimation (GNPE)—is based on self-consistently standardizing the “pose” of the data while estimating the posterior over parameters. It is architecture-independent, and applies both to exact and approximate equivariances. As a real-world application, we use GNPE for amortized inference of astrophysical binary black hole systems from gravitational- wave observations. We show that GNPE achieves state-of-the-art accuracy while reducing inference times by three orders of magnitude.

Maximilian Dax, Stephen R. Green, Jonathan Gair, Jakob H. Macke, Alessandra Buonanno, Bernhard Schölkopf (2021).  Real-Time Gravitational Wave Science with Neural Posterior Estimation. Phys. Rev. Lett., 127, 241103 [pdf] [press]

Real-Time Gravitational Wave Science with Neural Posterior Estimation

We demonstrate unprecedented accuracy for rapid gravitational wave parameter estimation with deep learning. Using neural networks as surrogates for Bayesian posterior distributions, we analyze eight gravitational wave events from the first LIGO-Virgo Gravitational-Wave Transient Catalog and find very close quantitative agreement with standard inference codes, but with inference times reduced from O(day) to 20 s per event. Our networks are trained using simulated data, including an estimate of the detector noise characteristics near the event. This encodes the signal and noise models within millions of neural-network parameters and enables inference for any observed data consistent with the training distribution, accounting for noise nonstationarity from event to event. Our algorithm—called “DINGO”—sets a new standard in fast and accurate inference of physical parameters of detected gravitational wave events, which should enable real-time data analysis without sacrificing accuracy.

Maximilian Dax, Dominik Stamen, Bastian Kubis (2021).  Dispersive analysis of the Primakoff reaction γK→Kπ . Eur.Phys.J.C 81, 221 [pdf] 

Dispersive analysis of the Primakoff reaction γK→Kπ

We provide a dispersion-theoretical representation of the reaction amplitudes γK → Kπ in all charge channels, based on modern pion–kaon P-wave phase shift input. Crossed-channel singularities are fixed from phenomenology as far as possible. We demonstrate how the subtraction constants can be matched to a low-energy theorem and radiative couplings of the K*(892) resonances, thereby providing a model-independent framework for future analyses of high-precision kaon Primakoff data.

Andong Tan, Duc Tam Nguyen, Maximilian Dax, Matthias Nießner, Thomas Brox (2021).  Explicitly Modeled Attention Maps for Image Classification . AAAI 2021 [pdf] 

Explicitly Modeled Attention Maps for Image Classification

Self-attention networks have shown remarkable progress in computer vision tasks such as image classification. The main benefit of the self-attention mechanism is the ability to capture long-range feature interactions in attention-maps. However, the computation of attention-maps requires a learnable key, query, and positional encoding, whose usage is often not intuitive and computationally expensive. To mitigate this problem, we propose a novel self-attention module with explicitly modeled attention-maps using only a single learnable parameter for low computational overhead. The design of explicitly modeled attention-maps using geometric prior is based on the observation that the spatial context for a given pixel within an image is mostly dominated by its neighbors, while more distant pixels have a minor contribution. Concretely, the attention-maps are parametrized via simple functions (e.g., Gaussian kernel) with a learnable radius, which is modeled independently of the input content. Our evaluation shows that our method achieves an accuracy improvement of up to 2.2% over the ResNet-baselines in ImageNet ILSVRC and outperforms other self-attention methods such as AA-ResNet152 in accuracy by 0.9% with 6.4% fewer parameters and 6.7% fewer GFLOPs. This result empirically indicates the value of incorporating geometric prior into self-attention mechanism when applied in image classification.

Duc Tam Nguyen*, Maximilian Dax*, Chaithanya Kumar Mummadi, Thi Phuong Nhung Ngo, Thi Hoai Phuong Nguyen, Zhongyu Lou, Thomas Brox (2019).  DeepUSPS: Deep Robust Unsupervised Saliency Prediction With Self-Supervision. NeurIPS 2019 [pdf] 

DeepUSPS: Deep Robust Unsupervised Saliency Prediction With Self-Supervision

Deep neural network (DNN) based salient object detection in images based on high-quality labels is expensive. Alternative unsupervised approaches rely on careful selection of multiple handcrafted saliency methods to generate noisy pseudo-ground-truth labels. In this work, we propose a two-stage mechanism for robust unsupervised object saliency prediction, where the first stage involves refinement of the noisy pseudo labels generated from different handcrafted methods. Each handcrafted method is substituted by a deep network that learns to generate the pseudo labels. These labels are refined incrementally in multiple iterations via our proposed self-supervision technique. In the second stage, the refined labels produced from multiple networks representing multiple saliency methods are used to train the actual saliency detection network. We show that this self-learning procedure outperforms all the existing unsupervised methods over different datasets. Results are even comparable to those of fully-supervised state-of-the-art approaches.

Maximilian Dax, Tobias Isken, Bastian Kubis (2018).  Quark-mass dependence in ω→3π  decays. Eur.Phys.J.C 78, 859 [pdf] 

Quark-mass dependence in ω→3π  decays

We study the quark-mass dependence of ω→3π  decays, based on a dispersion-theoretical framework. We rely on the quark-mass-dependent scattering phase shift for the pion-pion P-wave extracted from unitarized chiral perturbation theory. The dispersive representation then takes into account the final-state rescattering among all three pions. The described formalism may be used as an extrapolation tool for lattice QCD calculations of three-pion decays, for which ω→3π  can serve as a paradigm case.

Imprint / Provider Identification

The following provides mandatory data concerning the provider of this website, obligations with regard to data protection, as well as other important legal references involving this website as required by German law.

Provider

The provider of this Internet site within the legal meaning of the term is Maximilian Dax.

Address

Maximilian Dax

Max-Planck-Ring 4

72076 Tübingen

Germany

https://ei.is.tuebingen.mpg.de/person/mdax

Editors

Responsible editors for the contents of this website with regard to media law:

Maximilian Dax

Max-Planck-Ring 4

72076 Tübingen

Germany

For content related questions please contact firstname.lastname[at]tuebingen.mpg.de

Liability for Contents of Online Information

As the provider of contents in accordance with Section 7 Paragraph 1 of the Tele-Media Law, Maximilian Dax shall be responsible for any contents which it makes available for use in accordance with general legal provisions. Maximilian Dax makes every effort to provide timely and accurate information on this Web site. Nevertheless, errors and inaccuracies cannot be completely ruled out. Therefore, Maximilian Dax does not assume any liability for the relevance, accuracy, completeness or quality of the information provided. Maximilian Dax shall not be liable for damage of a tangible or intangible nature caused directly or indirectly through the use or failure to use the information offered and/or through the use of faulty or incomplete information unless it is verifiably culpable of intent or gross negligence. The same shall apply to any downloadable software available free of charge. Maximilian Dax reserves the right to modify, supplement, or delete any or all of the information offered on its Internet site, or to temporarily or permanently cease publication thereof without prior and separate notification.

Links to Internet Sites of Third Parties

This Internet site includes links to external pages. These external links are designated as follows: The respective provider shall be responsible for the contents of any linked external pages. In establishing the initial link, Maximilian Dax has reviewed the respective external content in order to determine whether such link entailed possible civil or criminal responsibility. However, a constant review of linked external pages is unreasonable without concrete reason to believe that a violation of the law may be involved. If Maximilian Dax determines such or it is pointed out by others that an external offer to which it is connected via a link entails civil or criminal responsibility, then Maximilian Dax will immediately eliminate any link to this offer. Maximilian Dax expressly dissociates itself from such contents.

Copyright

The images on the homepage are protected by copyright law. Maximilian Dax has been granted permission by the relevant entities.
Copyright for portrait photo: MPI for Intelligent Systems / W. Scheible


Opinions expressed on this website are my own and have no relation to my employer.