SIAM Student Chapter Seminar: Difference between revisions

From UW-Math Wiki
Jump to navigation Jump to search
(updated Haley Kottler's abstract)
 
(94 intermediate revisions by 8 users not shown)
Line 3: Line 3:
*'''When:''' Fridays at 1 PM unless noted otherwise
*'''When:''' Fridays at 1 PM unless noted otherwise
*'''Where:''' 9th floor lounge (we will also broadcast the virtual talks on the 9th floor lounge with refreshments)
*'''Where:''' 9th floor lounge (we will also broadcast the virtual talks on the 9th floor lounge with refreshments)
*'''Organizers:''' [https://sites.google.com/wisc.edu/evan-sorensen Evan Sorensen], Jordan Radke, Peiyi Chen, and Yahui Qu
*'''Organizers:''' Yahui Qu, Peiyi Chen and Zaidan Wu
*'''Faculty advisers:''' [http://www.math.wisc.edu/~jeanluc/ Jean-Luc Thiffeault], [http://pages.cs.wisc.edu/~swright/ Steve Wright]  
*'''Faculty advisers:''' [http://www.math.wisc.edu/~jeanluc/ Jean-Luc Thiffeault], [http://pages.cs.wisc.edu/~swright/ Steve Wright]  
*'''To join the SIAM Chapter mailing list:''' email [mailto:siam-chapter+join@g-groups.wisc.edu siam-chapter+join@g-groups.wisc.edu].
*'''To join the SIAM Chapter mailing list:''' email [mailto:siam-chapter+join@g-groups.wisc.edu siam-chapter+join@g-groups.wisc.edu].
*'''Zoom link:''' https://uwmadison.zoom.us/j/99844791267?pwd=eUFwM25Hc2Roc1kvSzR3N2tVVlpLQT09
*'''Zoom link:''' https://uwmadison.zoom.us/j/97976615799?pwd=U2xFSERIcnR6M1Y1czRmTjQ1bTFJQT09
*'''Passcode: 641156'''
*'''Passcode: 281031'''
 
== Fall 2024 ==


<br>
==Fall 2022==
{| class="wikitable"
{| class="wikitable"
!Date (1 PM unless otherwise noted)
|+
!Date
!Location
!Location
!Speaker
!Speaker
!Title
!Title
|-
|-
|9/23
|10 AM 10/4
|[https://uwmadison.zoom.us/j/99844791267?pwd=eUFwM25Hc2Roc1kvSzR3N2tVVlpLQT09 Virtual] and 911 Van Vleck                         
|Birge 346
|[http://www-personal.umich.edu/~tganders/ Thomas Anderson] (University of Michigan)                        
|Federica Ferrarese (University of Ferrara, Italy)
|A few words on potential theory in modern applied math
|Control plasma instabilities via an external magnetic field: deterministic and uncertain approaches
|-
|-
|9/30 ('''11 AM''')
|11 AM 10/18
|[https://uwmadison.zoom.us/j/99844791267?pwd=eUFwM25Hc2Roc1kvSzR3N2tVVlpLQT09 Virtual] and 911 Van Vleck
|9th floor
|[https://jeffhammond.github.io/ Jeff Hammond] (Principal Engineer at [https://www.nvidia.com/en-us/ NVIDIA])
|Martin Guerra (UW-Madison)
|Industry talk
|Swarm-Based Gradient Descent Meets Simulated Annealing
|-
|-
|10/7
|12:30 PM 10/31
|[https://uwmadison.zoom.us/j/99844791267?pwd=eUFwM25Hc2Roc1kvSzR3N2tVVlpLQT09 Virtual] and 911 Van Vleck
|VV 901
|[https://walterbabyrudin.github.io/ Jie Wang] (Georgia Institute of Technology)
|Chuanqi Zhang (University of Technology Sydney)
|Sinkhorn Distributionally Robust Optimization
|Faster isomorphism testing of p-groups of Frattini class-2
|-
|-
|10/14
|11/8
|[https://uwmadison.zoom.us/j/99844791267?pwd=eUFwM25Hc2Roc1kvSzR3N2tVVlpLQT09 Virtual] and 911 Van Vleck
|9th floor
|[https://you.stonybrook.edu/reutergroup/ Matt Reuter] (Stony Brook University)
|Borong Zhang (UW-Madison)
|Becoming a Ghost Buster
|Solving the Inverse Scattering Problem: Leveraging Symmetries for Machine Learning
|-
|-
|10/19 ('''Wednesday at 4 PM)'''
|11/15
|[https://uwmadison.zoom.us/j/99844791267?pwd=eUFwM25Hc2Roc1kvSzR3N2tVVlpLQT09 Virtual] and 911 Van Vleck
|9th floor
|Ying Li
(zoom)
|Industry talk
|Yantao Wu (Johns Hopkins University)
|Conditional Regression on Nonlinear Variable Model
|-
|-
|10/28
|
|911 Van Vleck
|
|[https://ylzhang2447.github.io/ Yinling Zhang] (UW-Madison)
|
|A Causality-Based Learning Approach for Discovering the Underlying Dynamics of Complex Systems from Partial Observations with Stochastic Parameterization
|
|-
|-
|11/4
|
|911 Van Vleck
|
|Haley Kottler (UW-Madison)
|
|Gaussian Mixture Model Parameter Recovery
|
|-
|-
|11/11
|911 Van Vleck
|[https://sites.google.com/wisc.edu/zinanwang/ Zinan Wang] (UW-Madison)
|
|
|-
|11/18
|911 Van Vleck
|Parvathi Kooloth (UW-Madison)
|
|
|-
|11/25
|NO TALK
|THANKSGIVING WEEK
|
|
|-
|12/2
|[https://uwmadison.zoom.us/j/99844791267?pwd=eUFwM25Hc2Roc1kvSzR3N2tVVlpLQT09 Virtual] and 911 Van Vleck
|Jenny Yeon (Applied Scientist at Amazon)
|
|
|}
|}


==Abstracts==
==Abstracts==
'''October 4th, Federica Ferrarese (University of Ferrara, Italy)''': The study of the problem of plasma confinement in huge devices, such as for example Tokamaks and Stellarators, has attracted a lot of attention in recent years. Strong magnetic fields in these systems can lead to instabilities, resulting in vortex formation. Due to the extremely high temperatures in plasma fusion, physical materials cannot be used for confinement, necessitating the use of external magnetic fields to control plasma density. This approach involves studying the evolution of plasma, made up of numerous particles, using the Vlasov-Poisson equations. In the first part of the talk, the case without uncertainty is explored. Particle dynamics are simulated using the Particle-in-Cell (PIC) method, known for its ability to capture kinetic effects and self-consistent interactions. The goal is to derive an instantaneous feedback control that forces the plasma density to achieve a desired distribution. Various numerical experiments are presented to validate the results. In the second part, uncertainty is introduced into the system, leading to the development of a different control strategy. This method is designed to steer the plasma towards a desired configuration even in the presence of uncertainty. The presentation concludes with a comparison of the two control strategies, supported by various numerical experiments.


'''9/23 Thomas Anderson:''' I'll talk a bit about potential theory as it is used today in the solution, via boundary integral equations / the boundary element method, of linear PDEs. These aren't only a numerical approach: I'll say a few words too about how they can be used to do analysis on problems. Then I may say a few things about volumetric potential theory: what are the problems there I've been thinking about, and application studies in mixing, for example, that they enable. Finally, I'll be happy to talk a bit about my experience so far in academia.
'''October 18th, Martin Guerra (UW-Madison)''': In generic non-convex optimization, one needs to be able to pull samples out of local optimal points to achieve global optimization. Two common strategies are deployed: adding stochasticity to samples such as Brownian motion, as is done in simulated annealing (SA), and employing a swarm of samples to explore the whole landscape, as is done in Swarm-Based Gradient Descent (SBGD). The two strategies have severe drawbacks but complement each other on their strengths. SA fails in the accuracy sense, i.e., finding the exact optimal point, but succeeds in always being able to get close, while SBGD fails in the probability sense, i.e., it has non-trivial probability to fail, but if succeeds, can find the exact optimal point. We propose to combine the strength of the two and develop a swarm-based stochastic gradient method with samples automatically adjusting their annealing. Using mean-field analysis and long-time behavior PDE tools, we can prove the method to succeed in both the accuracy sense and the probability sense. Numerical examples verify these theoretical findings.
 
'''9/30 Jeff Hammond:''' Jeff Hammond is a principal engineer with NVIDIA based in Helsinki, Finland, where his focus is developing better ways to write software for numerical algorithms. From 2014 to 2021, Jeff worked for Intel in Portland, Oregon; he started in the research organization and moved to the data center business group. Prior to that he worked for Argonne National Laboratory, first as a postdoc and then as a scientist in the supercomputing facility. Jeff was a graduate student at the University of Chicago and focused on developing open-source chemistry simulation software with Karol Kowalski at Pacific Northwest National Laboratory.  He majored in chemistry and mathematics at the University of Washington.  Details can be found on Jeff's home page: <nowiki>https://jeffhammond.github.io/</nowiki>. 
 
'''10/7 Jie Wang:''' We study distributionally robust optimization with Sinkhorn distance -- a variant of Wasserstein distance based on entropic regularization. We derive convex programming dual reformulations when the nominal distribution is an empirical distribution and a general distribution, respectively. Compared with Wasserstein DRO, it is computationally tractable for a larger class of loss functions, and its worst-case distribution is more reasonable. We propose an efficient stochastic mirror descent algorithm to solve the dual reformulation with provable convergence guarantees. Finally, we provide various numerical examples using both synthetic and real data to demonstrate its competitive performance and light computation cost.


'''10/12 Matt Reuter:''' As children, most of us didn't know what we wanted to be "when we grew up," and, when asked, might have said, "an astronaut" or "a firefighter." I wanted to be a Ghost Buster and, pragmatically, wound up in computational chemistry and applied mathematics. In this talk, I'll discuss the winding path of my career from school to the national laboratory system to tenure-track faculty to teaching-line faculty. Along the way I’ll discuss my work exorcising (1) numerical ghosts from nanoscience research and (2) psychological ghosts from students when teaching mathematics.
'''October 31st, Chuanqi Zhang''' (University of Technology Sydney): The finite group isomorphism problem asks to decide whether two finite groups of order N are isomorphic. Improving the classical $N^{O(\log N)}$-time algorithm for group isomorphism is a long-standing open problem. It is generally regarded that p-groups of class 2 and exponent p form a bottleneck case for group isomorphism in general. The recent breakthrough by Sun (STOC '23) presents an $N^{O((\log N)^{5/6})}$-time algorithm for this group class. Our work sharpens the key technical ingredients in Sun's algorithm and further improves Sun's result by presenting an $N^{\tilde O((\log N)^{1/2})}$-time algorithm for this group class. Besides, we also extend the result to the more general p-groups of Frattini class-2, which includes non-abelian 2-groups. In this talk, I will present the problem background and our main algorithm in detail, and introduce some connections with other research topics. For example, one intriguing connection is with the maximal and non-commutative ranks of matrix spaces, which have recently received considerable attention in algebraic complexity and computational invariant theory. Results from the theory of Tensor Isomorphism complexity class (Grochow--Qiao, SIAM J. Comput. '23) are utilized to simplify the algorithm and achieve the extension to p-groups of Frattini class-2.  


'''10/19 Ying Li:''' I will talk about my math background and my current role as a quantitative analytics specialist at Wells Fargo. Different types of quantitative analytics specialist at banking field will be generally introduced along with my opinions of the pros and cons for quantitative analytics jobs in financial area as a math student. I will also share my experience from academia to industry and the desired skill sets to be developed for looking for industry jobs.
'''November 8th, Borong Zhang''' (UW-Madison): The inverse scattering problem—reconstructing the properties of an unknown medium by probing it with waves and measuring the medium's response at the boundary—is fundamental in physics and engineering. This talk will focus on how leveraging the symmetries inherent in this problem can significantly enhance machine learning methods for its solution. By incorporating these symmetries into both deterministic neural network architectures and probabilistic frameworks like diffusion models, we achieve more accurate and computationally efficient reconstructions. This symmetry-driven approach reduces the complexity of the models and improves their performance, illustrating how physical principles can inform and strengthen machine learning techniques. Applications demonstrating these benefits will be briefly discussed.  


'''10/28 Yinling Zhang''': Discovering the underlying dynamics of complex systems from data is an important practical topic. In this paper, a new iterative learning algorithm for complex turbulent systems with partial observations is developed that alternates between identifying model structures, recovering unobserved variables, and estimating parameters. First, a causality-based learning approach is utilized for the sparse identification of model structures, which takes into account certain physics knowledge that is pre-learned from data. Next, a systematic nonlinear stochastic parameterization is built to characterize the time evolution of the unobserved variables. Furthermore, the localization of the state variable dependence and the physics constraints are incorporated into the learning procedure. Numerical experiments show that the new algorithm succeeds in identifying the model structure and providing suitable stochastic parameterizations for many complex nonlinear systems.  
'''November 15th, Yantao Wu''' (Johns Hopkins): We consider the problem of estimating the intrinsic structure of composite functions of the type $\mathbb{E} [Y|X] = f(\Pi_\gamma X) $ where $\Pi_\gamma:\mathbb{R}^d\to\mathbb{R}^1$ is the closest point projection operator onto some unknown smooth curve $\gamma: [0, L]\to \mathbb{R}^d$ and  $f: \mathbb{R}^1\to \mathbb{R}^1$ is some unknown  {\it link} function. This model is the generalization of the single-index model where $\mathbb{E}[Y|X]=f(\langle v, X\rangle)$ for some unknown {\it index} vector $v\in\mathbb{S}^{d-1}$. On the other hand, this model is a particular case of function composition model where $\mathbb{E}[Y|X] = f(g(x))$ for some unknown multivariate function $g:\mathbb{R}^d\to\mathbb{R}$. In this paper, we propose an algorithm based on conditional regression and show that under some assumptions restricting the complexity of curve $\gamma$, our algorithm can achieve the one-dimensional optimal minimax rate, plus a curve approximation error bounded by $\mathcal{O}(\sigma_\zeta^2)$. We also perform numerical tests to verify that our algorithm is robust, in the sense that even without some assumptions, the mean squared error can still achieve $\mathcal{O}(\sigma_\zeta^2)$.  


'''11/4 Haley Kottler:''' Gaussian mixture models are an important class of models that arise in many applications.  This talk will introduce these models, and talk about one method of parameter recovery from samples - the method of moments.  I will also discuss some of the challenges that arise in implementation of this method in the multivariate case. 
==Past Semesters==
==Past Semesters==
*[https://wiki.math.wisc.edu/index.php/SIAM_Spring_2024 Spring 2024]
*[[SIAM Fall 2023|Fall 2023]]
*[[SIAM Spring 2023|Spring 2023]]
*[[SIAM Seminar Fall 2022|Fall 2022]]
*[[Spring 2022 SIAM|Spring 2022]]
*[[Spring 2022 SIAM|Spring 2022]]
*[[SIAM Student Chapter Seminar/Fall2021|Fall 2021]]
*[[SIAM Student Chapter Seminar/Fall2021|Fall 2021]]

Latest revision as of 22:40, 13 November 2024


Fall 2024

Date Location Speaker Title
10 AM 10/4 Birge 346 Federica Ferrarese (University of Ferrara, Italy) Control plasma instabilities via an external magnetic field: deterministic and uncertain approaches
11 AM 10/18 9th floor Martin Guerra (UW-Madison) Swarm-Based Gradient Descent Meets Simulated Annealing
12:30 PM 10/31 VV 901 Chuanqi Zhang (University of Technology Sydney) Faster isomorphism testing of p-groups of Frattini class-2
11/8 9th floor Borong Zhang (UW-Madison) Solving the Inverse Scattering Problem: Leveraging Symmetries for Machine Learning
11/15 9th floor

(zoom)

Yantao Wu (Johns Hopkins University) Conditional Regression on Nonlinear Variable Model

Abstracts

October 4th, Federica Ferrarese (University of Ferrara, Italy): The study of the problem of plasma confinement in huge devices, such as for example Tokamaks and Stellarators, has attracted a lot of attention in recent years. Strong magnetic fields in these systems can lead to instabilities, resulting in vortex formation. Due to the extremely high temperatures in plasma fusion, physical materials cannot be used for confinement, necessitating the use of external magnetic fields to control plasma density. This approach involves studying the evolution of plasma, made up of numerous particles, using the Vlasov-Poisson equations. In the first part of the talk, the case without uncertainty is explored. Particle dynamics are simulated using the Particle-in-Cell (PIC) method, known for its ability to capture kinetic effects and self-consistent interactions. The goal is to derive an instantaneous feedback control that forces the plasma density to achieve a desired distribution. Various numerical experiments are presented to validate the results. In the second part, uncertainty is introduced into the system, leading to the development of a different control strategy. This method is designed to steer the plasma towards a desired configuration even in the presence of uncertainty. The presentation concludes with a comparison of the two control strategies, supported by various numerical experiments.

October 18th, Martin Guerra (UW-Madison): In generic non-convex optimization, one needs to be able to pull samples out of local optimal points to achieve global optimization. Two common strategies are deployed: adding stochasticity to samples such as Brownian motion, as is done in simulated annealing (SA), and employing a swarm of samples to explore the whole landscape, as is done in Swarm-Based Gradient Descent (SBGD). The two strategies have severe drawbacks but complement each other on their strengths. SA fails in the accuracy sense, i.e., finding the exact optimal point, but succeeds in always being able to get close, while SBGD fails in the probability sense, i.e., it has non-trivial probability to fail, but if succeeds, can find the exact optimal point. We propose to combine the strength of the two and develop a swarm-based stochastic gradient method with samples automatically adjusting their annealing. Using mean-field analysis and long-time behavior PDE tools, we can prove the method to succeed in both the accuracy sense and the probability sense. Numerical examples verify these theoretical findings.

October 31st, Chuanqi Zhang (University of Technology Sydney): The finite group isomorphism problem asks to decide whether two finite groups of order N are isomorphic. Improving the classical $N^{O(\log N)}$-time algorithm for group isomorphism is a long-standing open problem. It is generally regarded that p-groups of class 2 and exponent p form a bottleneck case for group isomorphism in general. The recent breakthrough by Sun (STOC '23) presents an $N^{O((\log N)^{5/6})}$-time algorithm for this group class. Our work sharpens the key technical ingredients in Sun's algorithm and further improves Sun's result by presenting an $N^{\tilde O((\log N)^{1/2})}$-time algorithm for this group class. Besides, we also extend the result to the more general p-groups of Frattini class-2, which includes non-abelian 2-groups. In this talk, I will present the problem background and our main algorithm in detail, and introduce some connections with other research topics. For example, one intriguing connection is with the maximal and non-commutative ranks of matrix spaces, which have recently received considerable attention in algebraic complexity and computational invariant theory. Results from the theory of Tensor Isomorphism complexity class (Grochow--Qiao, SIAM J. Comput. '23) are utilized to simplify the algorithm and achieve the extension to p-groups of Frattini class-2.

November 8th, Borong Zhang (UW-Madison): The inverse scattering problem—reconstructing the properties of an unknown medium by probing it with waves and measuring the medium's response at the boundary—is fundamental in physics and engineering. This talk will focus on how leveraging the symmetries inherent in this problem can significantly enhance machine learning methods for its solution. By incorporating these symmetries into both deterministic neural network architectures and probabilistic frameworks like diffusion models, we achieve more accurate and computationally efficient reconstructions. This symmetry-driven approach reduces the complexity of the models and improves their performance, illustrating how physical principles can inform and strengthen machine learning techniques. Applications demonstrating these benefits will be briefly discussed.

November 15th, Yantao Wu (Johns Hopkins): We consider the problem of estimating the intrinsic structure of composite functions of the type $\mathbb{E} [Y|X] = f(\Pi_\gamma X) $ where $\Pi_\gamma:\mathbb{R}^d\to\mathbb{R}^1$ is the closest point projection operator onto some unknown smooth curve $\gamma: [0, L]\to \mathbb{R}^d$ and  $f: \mathbb{R}^1\to \mathbb{R}^1$ is some unknown  {\it link} function. This model is the generalization of the single-index model where $\mathbb{E}[Y|X]=f(\langle v, X\rangle)$ for some unknown {\it index} vector $v\in\mathbb{S}^{d-1}$. On the other hand, this model is a particular case of function composition model where $\mathbb{E}[Y|X] = f(g(x))$ for some unknown multivariate function $g:\mathbb{R}^d\to\mathbb{R}$. In this paper, we propose an algorithm based on conditional regression and show that under some assumptions restricting the complexity of curve $\gamma$, our algorithm can achieve the one-dimensional optimal minimax rate, plus a curve approximation error bounded by $\mathcal{O}(\sigma_\zeta^2)$. We also perform numerical tests to verify that our algorithm is robust, in the sense that even without some assumptions, the mean squared error can still achieve $\mathcal{O}(\sigma_\zeta^2)$.

Past Semesters