Normal view MARC view ISBD view

Minimum divergence methods in statistical machine learning (Record no. 567557)

MARC details
000 -LEADER
fixed length control field	04074 a2200205 4500
003 - CONTROL NUMBER IDENTIFIER
control field	OSt
005 - DATE AND TIME OF LATEST TRANSACTION
control field	20250724164040.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	250704b \|\|\|\|\|\|\|\| \|\|\|\| 00\| 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
ISBN	9784431569206
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	006.31
Item number	Eg88m
100 ## - MAIN ENTRY--AUTHOR NAME
Personal name	Eguchi, Shinto
245 ## - TITLE STATEMENT
Title	Minimum divergence methods in statistical machine learning
Remainder of title	from an information geometric viewpoint
Statement of responsibility, etc	Shinto Eguchi
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Name of publisher	Springer
Year of publication	2022
Place of publication	Japan
300 ## - PHYSICAL DESCRIPTION
Number of Pages	x, 221p
520 ## - SUMMARY, ETC.
Summary, etc	This book explores minimum divergence methods of statistical machine learning for estimation, regression, prediction, and so forth, in which we engage in information geometry to elucidate their intrinsic properties of the corresponding loss functions, learning algorithms, and statistical models. One of the most elementary examples is Gauss's least squares estimator in a linear regression model, in which the estimator is given by minimization of the sum of squares between a response vector and a vector of the linear subspace hulled by explanatory vectors. This is extended to Fisher's maximum likelihood estimator (MLE) for an exponential model, in which the estimator is provided by minimization of the Kullback-Leibler (KL) divergence between a data distribution and a parametric distribution of the exponential model in an empirical analogue. Thus, we envisage a geometric interpretation of such minimization procedures such that a right triangle is kept with Pythagorean identity in the sense of the KL divergence. This understanding sublimates a dualistic interplay between a statistical estimation and model, which requires dual geodesic paths, called m-geodesic and e-geodesic paths, in a framework of information geometry.<br/>We extend such a dualistic structure of the MLE and exponential model to that of the minimum divergence estimator and the maximum entropy model, which is applied to robust statistics, maximum entropy, density estimation, principal component analysis, independent component analysis, regression analysis, manifold learning, boosting algorithm, clustering, dynamic treatment regimes, and so forth. We consider a variety of information divergence measures typically including KL divergence to express departure from one probability distribution to another. An information divergence is decomposed into the cross-entropy and the (diagonal) entropy in which the entropy associates with a generative model as a family of maximum entropy distributions; the cross entropy associates with a statistical estimation method via minimization of the empirical analogue based on given data. Thus any statistical divergence includes an intrinsic object between the generative model and the estimation method. Typically, KL divergence leads to the exponential model and the maximum likelihood estimation. It is shown that any information divergence leads to a Riemannian metric and a pair of the linear connections in the framework of information geometry.<br/>We focus on a class of information divergence generated by an increasing and convex function U, called U-divergence. It is shown that any generator function U generates the U-entropy and U-divergence, in which there is a dualistic structure between the U-divergence method and the maximum U-entropy model. We observe that a specific choice of U leads to a robust statistical procedurevia the minimum U-divergence method. If U is selected as an exponential function, then the corresponding U-entropy and U-divergence are reduced to the Boltzmann-Shanon entropy and the KL divergence; the minimum U-divergence estimator is equivalent to the MLE. For robust supervised learning to predict a class label we observe that the U-boosting algorithm performs well for contamination of mislabel examples if U is appropriately selected. We present such maximal U-entropy and minimum U-divergence methods, in particular, selecting a power function as U to provide flexible performance in statistical machine learning.<br/><br/>
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term	Machine learning
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term	Statistical methods
700 ## - ADDED ENTRY--PERSONAL NAME
Personal name	Komori, Osamu
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type	Books

Holdings
Withdrawn status	Lost status	Damaged status	Not for loan	Collection code	Home library	Current library	Date acquired	Source of acquisition	Cost, normal purchase price	Full call number	Accession Number	Cost, replacement price	Koha item type
				On Display	PK Kelkar Library, IIT Kanpur	PK Kelkar Library, IIT Kanpur	24/07/2025	2	10528.65	006.31 Eg88m	A186928	14038.20	Books

Place hold
Print
Add to your cart (remove)
Save record
BIBTEX Dublin Core ISBD MARC (non-Unicode/MARC-8) RIS MARC (Unicode/UTF-8)
More searches

Search for this title in:
Other Libraries (WorldCat) Other Databases (Google Scholar) Online Stores (Bookfinder.com) Open Library (openlibrary.org)

Welcome to P K Kelkar Library, Online Public Access Catalogue (OPAC)

Minimum divergence methods in statistical machine learning (Record no. 567557)