ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.01657
46
0
v1v2v3 (latest)

Robust estimation of a regression function in exponential families

3 November 2020
Y. Baraud
Juntong Chen
ArXiv (abs)PDFHTML
Abstract

We observe nnn pairs of independent (but not necessarily i.i.d.) random variables X1=(W1,Y1),…,Xn=(Wn,Yn)X_{1}=(W_{1},Y_{1}),\ldots,X_{n}=(W_{n},Y_{n})X1​=(W1​,Y1​),…,Xn​=(Wn​,Yn​) and tackle the problem of estimating the conditional distributions Qi⋆(wi)Q_{i}^{\star}(w_{i})Qi⋆​(wi​) of YiY_{i}Yi​ given Wi=wiW_{i}=w_{i}Wi​=wi​ for all i∈{1,…,n}i\in\{1,\ldots,n\}i∈{1,…,n}. Even though these might not be true, we base our estimator on the assumptions that the data are i.i.d.\ and the conditional distributions of YiY_{i}Yi​ given Wi=wiW_{i}=w_{i}Wi​=wi​ belong to a one parameter exponential family Qˉ\bar{\mathscr{Q}}Qˉ with parameter space given by an interval III. More precisely, we pretend that these conditional distributions take the form Qθ(wi)∈QˉQ_{{\boldsymbol{\theta}}(w_{i})}\in \bar{\mathscr{Q}}Qθ(wi​)​∈Qˉ for some θ{\boldsymbol{\theta}}θ that belongs to a VC-class Θˉ\bar{\boldsymbol{\Theta}}Θˉ of functions with values in III. For each i∈{1,…,n}i\in\{1,\ldots,n\}i∈{1,…,n}, we estimate Qi⋆(wi)Q_{i}^{\star}(w_{i})Qi⋆​(wi​) by a distribution of the same form, i.e.\ Qθ^(wi)∈QˉQ_{\hat{\boldsymbol{\theta}}(w_{i})}\in \bar{\mathscr{Q}}Qθ^(wi​)​∈Qˉ, where θ^=θ^(X1,…,Xn)\hat {\boldsymbol{\theta}}=\hat {\boldsymbol{\theta}}(X_{1},\ldots,X_{n})θ^=θ^(X1​,…,Xn​) is a well-chosen estimator with values in Θˉ\bar{\boldsymbol{\Theta}}Θˉ. We show that our estimation strategy is robust to model misspecification, contamination and the presence of outliers. Besides, we provide an algorithm for calculating θ^\hat{\boldsymbol{\theta}}θ^ when Θˉ\bar{\boldsymbol{\Theta}}Θˉ is a VC-class of functions of low or moderate dimension and we carry out a simulation study to compare the performance of θ^\hat{\boldsymbol{\theta}}θ^ to that of the MLE and median-based estimators.

View on arXiv
Comments on this paper