Optimization by space transformation and decomposition

Gratton, S; Vicente, LN; Zhang, Z

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/92512

DC Field	Value	Language
dc.contributor	Department of Applied Mathematics	en_US
dc.creator	Gratton, S	en_US
dc.creator	Vicente, LN	en_US
dc.creator	Zhang, Z	en_US
dc.date.accessioned	2022-04-12T08:43:29Z	-
dc.date.available	2022-04-12T08:43:29Z	-
dc.identifier.uri	http://hdl.handle.net/10397/92512	-
dc.language.iso	en	en_US
dc.rights	Posted with permission of the author.	en_US
dc.subject	Space transformation	en_US
dc.subject	Space decomposition	en_US
dc.subject	Coarse space	en_US
dc.subject	Parallel	en_US
dc.subject	Inaccurate gradient	en_US
dc.subject	Trust-region methods	en_US
dc.subject	Worst-case complexity	en_US
dc.title	Optimization by space transformation and decomposition	en_US
dc.type	Preprint	en_US
dcterms.abstract	We introduce a space transformation framework for unconstrained optimization of a function f that is smooth but not necessarily convex. At each iteration of the framework, the gradient of f is mapped by a forward transformation to an auxiliary (possibly lifted) space, in which a model is established based on the transformed gradient and a step is obtained by minimizing this model. A backward transformation then maps the step back to the original space, where the iterate is updated accordingly. Using a trust-region globalization scheme, and by inspection of the consistency between the forward and backward transformations, the framework is guaranteed to converge globally to first-order criticality with an O(E 2) worstcase iteration complexity to reach a gradient with norm smaller than E > 0. The complexity is improved to O(E 1) and O(log(E 1)) when f is convex and strongly convex respectively.	en_US
dcterms.abstract	The space transformation framework can be directly specialized to a parallel space decomposition framework for nonlinear optimization, which can be regarded as an extension of the parallel Schwarz domain decomposition method for PDEs and linear systems. Such a decomposition framework is motivated by problems that cannot be directly solved (or even saved) in the full space and hence divide-and-conquer is needed. As in domain decomposition, introducing overlaps between subspaces can improve the performance of space decomposition methods provided that overlaps are handled properly. Our space decomposition framework covers a wide range of overlap-handling strategies, including Restricted Additive Schwarz, Weighted Restricted Additive Schwarz, and Additive Schwarz with Harmonic Extension, all of which have achieved remarkable success in domain decomposition. Similar to the coarse grid techniques in domain decomposition, we incorporate a coarse space correction into our framework. An unsophisticated implementation of the framework works quite well in our numerical experiments. With Restricted Additive Schwarz and coarse space correction, the algorithm scales nicely when the number of subspaces increases.	en_US
dcterms.abstract	At the same time, the space transformation framework can be specialized to analyze trustregion methods when the gradient of f is evaluated inaccurately. Applying the theory of the space transformation framework, we provide sharp bounds on the gradient inaccuracy that guarantee global convergence of trust-region methods, and prove new global convergence results while recovering some existing ones. More importantly, we establish a complete theory of worst-case complexities for trust-region methods based on inaccurate gradients. If the gradient inaccuracy respects the aforementioned bounds, these complexities are of the same order as when the gradients are accurate, and the gradient inaccuracy only enlarges the multiplicative constants in them by a mild magnitude that is entirely decided by trust-region algorithmic parameters and independent of the optimization problem. On the other hand, once the gradient inaccuracy exceeds such bounds, the behavior of trust-region methods deteriorates dramatically, which is demonstrated by our numerical experiment. Our theory provides theoretical insights into the tight connection between algorithmic parameters and the amount of relative gradient error that trust-region methods can tolerate. We show that such a connection is clearly observable in computation. This enables us to propose parameters that are well adapted to a given accuracy level of the gradient.	en_US
dcterms.abstract	[Figure not available: see fulltext.]	en_US
dcterms.accessRights	open access	en_US
dcterms.issued	2019	-
dc.description.validate	202204 bcwh	en_US
dc.description.oa	Author’s Original	en_US
dc.identifier.FolderNumber	RGC-B1-178	-
dc.description.fundingSource	RGC	en_US
dc.description.pubStatus	Unpublish	en_US
dc.description.oaCategory	Copyright retained by author	en_US
Appears in Collections:	Other

Files in This Item:

File	Description	Size	Format
Gratton_Optimizatiby_Space_Transformatidecomposition.pdf	Preprint version	904.96 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Author’s Original

Access

View full-text via PolyU eLinks

Show simple item record

Page views

129

Last Week
0

Last month

Citations as of Apr 14, 2025

Downloads

93

Citations as of Apr 14, 2025

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Downloads

Google ScholarTM

Google Scholar^TM