Next Article in Journal
Information Flow in Times of Crisis: The Case of the European Banking and Sovereign Sectors
Previous Article in Journal
Acknowledgement to Reviewers of Econometrics in 2018
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Gini Regressions and Heteroskedasticity †

1
Centre de Recherche en Economie et Management (CREM), Université de Rennes, 35000 Rennes, France
2
Département D’économie, Université Alioune Diop de Bambey, Bambey BP 30, Senegal
3
Chrome, Université de Nîmes, 30000 Nîmes, France
*
Author to whom correspondence should be addressed.
The authors would like to thank their three reviewers. The usual disclaimer applies.
Econometrics 2019, 7(1), 4; https://doi.org/10.3390/econometrics7010004
Submission received: 20 July 2018 / Revised: 19 November 2018 / Accepted: 4 January 2019 / Published: 14 January 2019

Abstract

:
We propose an Aitken estimator for Gini regression. The suggested A -Gini estimator is proven to be a U-statistics. Monte Carlo simulations are provided to deal with heteroskedasticity and to make some comparisons between the generalized least squares and the Gini regression. A Gini-White test is proposed and shows that a better power is obtained compared with the usual White test when outlying observations contaminate the data.
JEL Classification:
C14; C3

1. Introduction

Among 1 regressions, the Gini regression initiated by Olkin and Yitzhaki (1992) is increasingly used in econometrics. It enables traditional hypotheses to be relaxed such as the linearity of the model. Moreover, it is well suited for the study of variables contaminated by outliers or measurement errors. The reader is referred to Yitzhaki and Schechtman (2013) for a complete overview of the Gini methodology.
Shelef and Schechtman (2011) and Carcea and Serfling (2015) investigated independently the use of the Gini autocovariance functions to estimate, respectively, the parameter of AR(1) and ARMA processes in the case of heavy tailed distributions such as Pareto processes. Recently, Mussard and Ndiaye (2018) investigated the semi-parametric Gini regression for vector autoregressive models in which non-spherical disturbances occur. They showed that premultiplying the model by a matrix that neutralizes the Gini covariance of the error terms may produce non-biased Gini estimators.
In the context of semi-parametric Gini regressions, we showed that the Aitken transformation (Aitken 1935) for non-spherical disturbances based on the variance provides exactly the same estimator obtained by neutralizing the Gini covariance of the error term. However, the convergence of the former estimator requires the existence of the second moment of the error term, whereas the latter is a U-statistics. Monte Carlo simulations are addressed in order to show the superiority of the Aitken-Gini estimator compared with the traditional GLS estimator in the presence heteroskedasticity. It is also shown that the usual White test to detect heteroskedasticity should be done in the Gini sense, that is, by testing the Gini covariance of the regressors instead of their variance. In this case, more power is obtained for small samples. Finally, a feasible generalized Gini regression is provided. It consists in estimating the residuals of the regression (with the semi-parametric Gini regression) and to plug those residuals in the model to purge the heteroskedasticity (with a Gini instrumental variable regression). Monte Carlo simulations prove the superiority of this algorithm in the presence of outlying observations compared with the usual White algorithm based on generalized least squares.
The remainder of this paper is outlined as follows. We begin in Section 2 with the two versions of the Aitken-Gini estimator based, respectively, on the variance and on the Gini covariance, before showing their equivalence. Section 3 is devoted to the convergence property. Section 4 presents Monte Carlo simulations. It is shown that the combination of nonspherical disturbances and outliers (measurement errors) imply a loss of efficiency, which is not so important in the Gini case compared with GLS. In addition, the power of White’s test is analyzed in the presence of outlying observations. Section 5 closes the article.

2. Aitken-Gini Estimators

It is common practice to deal with heteroskedasticity and autocorrelation with generalized least squares. However, if the data are contaminated by outliers, the loss of efficiency can drastically affect the coefficient estimates. In the following, outliers are assumed to be contaminated data, such as measurement errors, which lead to bad estimates. It is shown that the employ of the Aitken-Gini estimator may be preferred to GLS when the data are contaminated by outlying observations. The model is the following:
y = X β g + ε g ,
where y is the dependent variable ( n × 1 vector), X [ x i k ] is the matrix of the regressors (of size n × K with a first column of ones), β g is the K × 1 vector of parameters to be estimated, and ε g is the vector of perturbation terms (of size n × 1 ). Following Olkin and Yitzhaki (1992), the semi-parametric Gini regression yields an estimator of β g ,
β ^ g = ( R x X ) 1 R x y ,
where R x is the rank matrix of X . The rank matrix R x is the matrix in which, for each regressor x k ( k = 1 , , K ), the observations x i k ( i = 1 , , n ) are replaced by their rank within x k (the smallest value of x i k is replaced by 1, and the highest one by n). Olkin and Yitzhaki (1992) showed that Gini estimators may be of particular relevance when outliers arise in the data. It is worth mentioning that, in the sequel, only the semi-parametric Gini regression is investigated. The parametric Gini regression is a numerical technique relying on the minimization of the Gini index of the residuals, which yields the same estimator as the semi-parametric Gini regression when the model is linear.

2.1. Mimicking the Usual Aitken Estimator

The generalized least squares (GLS) technique requires, in the case of heteroskedasticity and non serial correlation, the traditional following hypotheses E ( ε g , i ) = 0 , cov ( ε g , i , ε g , j ) = 0 for all i j , and E ( ε g , i 2 ) = σ i 2 , such that E [ ε g ε g ] = σ 2 Ω with
Ω = a 1 0 0 0 0 a 2 0 0 0 0 0 a n , a i > 0 , i = 1 , , n .
Let us denote by Var ( ε g ) = E [ ε g ε g ] the variance of the error term such that Var ( ε g ) : = σ 2 Ω . Let P = Ω 1 2 , then setting y : = P y , X : = P X and ε g : = P ε g yields:1
P y = P X β g + P ε g y = X β g + ε g .
Thereby, a first Aitken-Gini estimator may be derived.
Proposition 1.
Let R x be the rank matrix of X , then applying the usual semi-parametric Gini regression in Equation (2) to the model in Equation (3) yields:
β ^ g = ( R x X ) 1 R x y such that E [ ε g ε g ] = σ 2 I n .
Proof. 
The application of the semi-parametric Gini regression to the model in Equation (3) is obvious. Note that:
E [ ε g ε g ] = P σ 2 Ω P = σ 2 I n .
 □
We obtain a result quite close to Equation (2), which has the form of estimators by instrumental variables (IV) (see Yitzhaki and Schechtman (2004) for the link between Gini regressions and IV). Indeed, setting Z : = R x P , we get an IV estimator:
β ^ g = ( R x X ) 1 R x y = ( R x P X ) 1 R x P y = ( Z X ) 1 Z y .
The Gini estimator in Equation (4) is derived by mimicking the usual Aitken estimator, which may be used if, and only if, the variability of the error term is defined with respect to the variance, i.e., E [ ε g ε g ] = σ 2 I n . However, the Gini methodology is employed whenever the underlying variability is the covariance-Gini defined by Schechtman and Yitzhaki (1987), the co-Gini from now on, which is examined in the next subsection.

2.2. The Aitken-Gini Estimator

The usual Aitken estimator described in the previous subsection is valid whenever the second moments of ε g are known and when no outliers occur in X , for which the first moments exist. The Gini estimator may be one solution to overcome this difficulty without invoking the existence of the second moment of ε g . For that purpose, we must define the transformed model,
P y = P X β a g + P ε g y = X β a g + ε g ,
such that there is no heteroskedasticity in the Gini sense, that is, the co-Gini of ε g , i remains constant for all i = 1 , , n . Let the co-Gini operator be defined such that:
cog ( ε g , i , ε g , i ) : = cov ( ε g , i , F ε ( ε g , i ) ) ,
where F ε ( ε g , i ) is the cumulative distribution function of ε g , i . In this respect, we have E [ ε g F ε ( ε g ) ] = g Ω G with g 0 such that,
Ω G = b 1 0 0 0 0 b 2 0 0 0 0 0 b n , b i > 0 , i = 1 , , n .
The Aitken-Gini estimator must be defined according to the P -rank idempotent property of the transformation matrix P .
Definition 1.
A squared matrix P is said to be P -rank idempotent if, for any given real random variable X, such that Z = P X ,
F x ( X ) = F z ( P X ) ,
where F x and F z stand for the cumulative distribution functions of X and Z, respectively.
For estimation purposes, this assumption implies that the rank vector of X remains invariant after any given transformation P . This assumption is necessary to obtain spherical disturbances.
Proposition 2.
If there exists a P -rank idempotent matrix such that F ε ( P ε g ) = F ε ( ε g ) then applying the usual semi-parametric Gini regression in Equation (2) to the model in Equation (5) yields the Aitken-Gini estimator:
β ^ a g = ( R x X ) 1 R x y such that E [ ε g F ε ( ε g ) ] = g I n .
Proof. 
If P is supposed to be P -rank idempotent, then F ε ( P ε g ) = F ε ( ε g ) . From the transformed model in Equation (5):
E [ ε g F ε ( ε g ) ] = E [ P ε g F ε ( P ε g ) ] = P E [ ε g F ε ( P ε g ) ] = g I n .
Since F ε ( P ε g ) = F ε ( ε g ) , then
P E [ ε g F ε ( ε g ) ] = P g Ω G = g I n P = [ Ω G ] 1 .
Note that the invertibility of Ω G is ensured since it is positive semi-definite. □

2.3. A Reconciliation

In the previous subsections, two Aitkien-Gini estimators have been derived. We can actually show that the Gini estimator β ^ g that mimics the GLS is equivalent to the Aitken-Gini estimator β ^ a g .
Proposition 3.
Let ξ i be an i.i.d. process such that Var ( ξ i ) = σ 2 and cog ( ξ i , ξ i ) = g > 0 . Let ε g i = ξ i h ( i ) such that i = 1 , , n for some real-valued function h : N + R + + and assume that ε g : = Ω 1 2 ε g and ε ˜ g : = [ Ω G ] 1 ε g . Then, the following assertions hold:
(i)
Var ( ε g , i ) = σ 2 .
(ii)
cog ( ε g , i , ε g , i ) = g .
(iii)
β ^ g = β ^ a g and ε ˜ g = ε g .
Proof. 
(i) Let us remark that Var ( ε g i ) = i i d Var ( ξ h ( i ) ) = σ 2 h ( i ) . Consequently,
σ 2 Ω = σ 2 h ( 1 ) 0 0 0 h ( i ) h ( n ) .
Thus, by Proposition 1 we get that P = Ω 1 2 , consequently the transformed model provides ε g = P ε g = Ω 1 2 ε g = ( ξ 1 , , ξ n ) . Hence, Var ( ε g , i ) = Var ( ξ i ) = σ 2 .
(ii) We have cog ( ε g , i , ε g , i ) = i i d cog ( ξ h ( i ) , F ( ξ h ( i ) ) ) = h ( i ) cog ( ξ , F ( ξ ) ) . Thereby, cog ( ε g , i , ε g , i ) = g h ( i ) and so:
g Ω G = g h ( 1 ) 0 0 0 h ( i ) h ( n ) .
By Proposition 2, we get that P = [ Ω G ] 1 . The transformed model yields ε ˜ g = P ε g = [ Ω G ] 1 ε g = ( ξ 1 , , ξ n ) . Hence, cog ( ε ˜ g , i , ε ˜ g , i ) = cog ( ξ i , ξ i ) = g .
(iii) Considering Assertions (i) and (ii), it follows that ε g = ε ˜ g . Note that both estimators β ^ g and β ^ a g are issued from ( R x X ) 1 R x y with X = P X , y = P y and R x the rank matrix of X . Note that P = Ω 1 2 is employed in the first case and P = [ Ω G ] 1 in the second one. Since Ω 1 2 = [ Ω G ] 1 , then β ^ g = β ^ a g , which concludes the proof. □
The previous results are all based on the rank matrix of X ; consequently, as shown by Olkin and Yitzhaki (1992), the semi-parametric Gini regression is robust to outliers. Although the previous proposition indicates that an equivalence exists between the two Aitken-Gini estimators β ^ g and β ^ a g ; it is noteworthy that β ^ a g requires fewer assumptions, since the first moment of ε g has to be known only, whereas β ^ g is based on the existence of the two first moments of ε g .

3. Sampling Properties

The aim of this section is to show, as above, that two strategies are available to get the sampling variance of the Aitken-Gini estimator. The first one is to consider that the second moment of ε g , i exists, as usual in the case of least squares regression, to derive the asymptotic variance of β ^ g . The second one assumes that the second moment of ε g , i does not exist, thus the variance of β ^ a g is derived by jackknife. For this purpose, one needs additional assumptions.
We start again with the transformed model:
P y = P X β a g + P ε g y = X β a g + ε g .
The hypotheses of the model are the following:
H1: E [ ε g ] = 0 .
H2: Whenever P is a non null matrix, the perturbation term ε g is linearly approximated as follows:2
ε g = y X β a g .
H3: The perturbation term ε g is independent of R x .
H4: R x X is a positive definite matrix.
H5: The matrix Ω G is diagonal and it contains finite elements ω i > 0 .
H6: The second moment of ε g , i exists for all i = 1 , , n such that E [ ( ε g ) 2 ] = σ 2 I n .
Hypothesis H2 is necessary because the semi-parametric Gini regression does not rely on the usual linearity assumption of the regressors. First, we prove that the estimators β ^ g and β ^ a g are unbiased. The proof is made for β ^ a g only, since it is similar in both cases.
Proposition 4.
Under HypothesesH1H5, β ^ a g is an unbiased estimator of β a g .
Proof. 
From Equation (6), β ^ a g = ( R x X ) 1 R x y and X = P X = [ Ω G ] 1 X , with R x X being invertible by Hypothesis H4, and with Ω being invertible by Hypothesis H 5 . Let Z : = R x [ Ω G ] 1 , then we have by Hypothesis H2:
β ^ a g = ( Z X ) 1 Z y = ( Z X ) 1 Z ( X β a g + ε g ) = β a g + ( Z X ) 1 Z ε g .
Since Z ε g = R x [ Ω G ] 1 ε g = R x ε g , therefore, by Hypotheses H1 and H3:
E [ β ^ a g ] = E [ β a g + ( Z X ) 1 R x ε g ] = β a g .
Note that Hypothesis H3 is respected whenever outliers do not contaminate the sample. □

3.1. Convergence

We suppose that the second moments of ε g , i exists [Hypothesis H6] in order to derive the asymptotic variance of β ^ g and to check for its convergence. By the result of Proposition 3, we set Z : = R x [ Ω ] 1 2 = R x [ Ω G ] 1 .
Proposition 5.
Under HypothesesH1H6, the following assertions hold.
(i)
Var ( β ^ g | X ) = σ 2 ( R x [ Ω G ] 1 X ) 1 R x R x [ ( R x [ Ω G ] 1 X ) 1 ] .
(ii)
β ^ g is convergent.
Proof. 
(i) From Equation (7), we deduce that:
Var ( β ^ g | X ) = E ( β ^ g β g ) ( β ^ g β g ) = E ( Z X ) 1 R x ε g ε g R x [ ( Z X ) 1 ] = σ 2 ( R x [ Ω G ] 1 X ) 1 R x R x [ ( R x [ Ω G ] 1 X ) 1 ] .
(ii) From Proposition 4, β ^ g is an unbiased estimate of β g . We have Z X = R x [ Ω G ] 1 X = R x X . The matrix Z X exists since Ω G is invertible by Hypothesis H5. Thus by Hypothesis H4,
plim 1 n Z X = plim 1 n R x [ Ω G ] 1 X = plim 1 n i = 1 n ω i 1 r i x i
is a positive definite matrix (with r i and x i being rows of R x and X , respectively). Then, the asymptotic variance covariance matrix exists and amounts to:
a s . Var ( β ^ g | X ) = σ 2 n plim 1 n Z X 1 1 n R x R x plim 1 n ( Z X ) 1 .
Letting n tend towards infinity, we get that lim n + a s . Var ( β ^ g | X ) = 0 . □
As mentioned by Yitzhaki and Schechtman (2013), the inference on the regressors of the semi-parametric Gini regression has to be performed with U-statistics. In this case, the convergence is ensured without invoking Hypothesis H6.

3.2. Convergence with U-Statistics

As shown by Yitzhaki and Schechtman (2013), Gini estimators are U-statistics. The main advantage of dealing with the class of U-statistics, based on the generalized notion of average, is to find unbiased estimators and to derive their asymptotic property. The reader is referred to Serfling (1980, chp. 5) for more details. A brief review of this chapter is provided below.
Let X 1 , X 2 , , X m be independent observations from a population on a distribution F. The parameter ϑ ( F ) of the population is a parametric function for which an unbiased estimator exists. It is expressed as:
ϑ ( F ) = E [ ϕ ( X 1 , X 2 , , X m ) ] = ϕ ( x 1 , , x m ) d F ( x 1 ) d F ( x m ) ,
where ϕ ( x 1 , x 2 , , x m ) , called the kernel, is a symmetric function3. The U-statistics of ϑ ( F ) is an estimator based on a sample of size n, X 1 , , X n , such that n m . Averaging the kernel ϕ , the U-statistics is written as:
U n : = U ( X 1 , X 2 , , X n ) = m n 1 c ϕ ( X i 1 , X i 2 , , X i m ) ,
where c denotes the sum over all combinations of m elements { i 1 , , i m } from { 1 , , n } . The U-statistic for the parameter ϑ is an unbiased estimator of ϑ and the distribution of n ( U ϑ ) tends to a normal distribution as n under the condition that E [ ϕ 2 ( X 1 , X 2 , , X m ) ] < . The variance of a U-statistic also relies on the existence of second moments. Let the sets A : = { a 1 , , a m } and B : = { b 1 , , b m } be composed of m distinct integers among the set { 1 , , n } , with c the number of common integers of sets A and B. Let ϕ ˜ : = ϕ ϑ , then, by symmetry of ϕ ˜ as well as the independence of the observations X 1 , , X n of the sample:
ξ c : = E { ϕ ˜ ( X a 1 , , X a m ) ϕ ˜ ( X b 1 , , X b m ) }
Defining
U n ϑ = m n 1 c ϕ ˜ ( X i 1 , X i 2 , , X i m ) ,
the variance of a U-statistic is given by:
Var ( U n ) = E { ( U n ϑ ) 2 } = n m 2 c = 0 m n m m c n m m c ξ c ,
with n m m c n m m c the number of possibilities for sets A and B to get c elements in common, and with the hypothesis E { ϕ 2 ( X 1 , , X m ) } < . The estimation by jackknife of the variance of U n does not necessitate such an assumption:
Var ( U n ) = n 1 n i = 1 n U i 1 n i = 1 n U i 2 ,
where U i is the estimator based on a sample of size n 1 , without the ith observation.
Proposition 6.
Each element β ^ a g , k of the Aitken-Gini estimator β ^ a g = ( β ^ a g , 1 , , β ^ a g , K ) is a function of U-statistics, thus estimating the variance of β ^ a g , k by jackknife for all k = 1 , , K implies that β ^ a g , k is a consistent estimator of β a g , k such that β ^ a g , k a N for all k = 1 , , K , neither invoking the existence of the second moments of X nor those of ε g [H6].
Proof. 
See Appendix A. □

4. Tests and Simulations

In this section, it is shown that the semi-parametric Aitken-Gini estimator β ^ a g is more robust than the usual GLS one when the data are contaminated by outliers with Ω being known. Furthermore, a feasible generalized Gini regression is proposed to deal with the case where Ω is unknown.

4.1. Monte Carlo Simulations

We performed Monte Carlo simulations to assess the robustness of the semi-parametric Gini regression with outlying observations in X and the presence of heteroskedasticity. In this section, we assume that the heteroskedasticity shape is known. The steps of the Monte Carlo simulation were as follows.
Step 1: Generate three independent normal distributions x j N ( 0 , 1 ) of size n = 100 such that j = 2 , 3 , 4 , with x 1 = ( 1 , , 1 ) and X = ( x 1 , x 2 , x 3 , x 4 ) .
Step 2: Sort the matrix X by ascending order according to the vector x 2 (the second column of X ). Multiply the last row of X by θ = 100 , , 10 , 000 (with increments of 100) in order to inflate the most important value of x 2 : X n o : = θ X n .
Step 3: For each outlier θ , perform B = 1000 simulations, i.e., generate B × 3 independent normal distributions x j N ( 0 , 1 ) for all j = 2 , 3 , 4 with one outlier valued to be X n o : = θ X n .
Step 4: Generate heteroskedasticity as follows Ω G = d i a g ( 100 i ) and fix a vector β a g = ( 10 , 3 , 10 , 58 ) to compute y = X β a g + ε ˜ g with ε g , i N ( 0 , 1 ) independent of x j and with ε ˜ g = [ Ω G ] 1 ε g .
Step 5: Regress y on X o with the semi-parametric Gini regression and with GLS in order to estimate β a g . Compute the standard deviation of β ^ a g by jackknife in the first case, and the standard deviation of the GLS estimators in the second case (for each value of θ ). Measure the mean squared error of the coefficient estimates β ^ a g over B replications (for each θ ) for both techniques: Gini and GLS.
The jackknife standard deviations of the estimators β ^ a g , k for k = 1 , , 4 are reported in Figure 1 for each value of the outlier θ . As depicted in Figure 1, jackknife standard deviations of the Aitken-Gini estimator are lower than those of the GLS estimator, which are drastically affected by the introduction of one outlier in a sample of n = 100 observations. This corresponds to a contamination of the sample of only 1 % . Since the outlying observation corresponds to the last row of X ( X n o = θ X n ), in which there is the most important value of x 2 , the vector x 2 is the most contaminated regressor. Therefore, as shown in Figure 1 (top right), important variations of the standard deviation of β ^ a g , 2 are recorded, especially for the GLS estimator (red curve) compared with the Gini one (black curve).
In addition, it is possible to compute the mean squared errors of the GLS estimator and the Aitken-Gini one. The contamination process is the same as before.
The Aitken-Gini estimator is better than the usual GLS estimator for the constant of the model (Figure 2, top left) and for the second regressor x 2 (Figure 2, top right), as depicted in Figure 2. This is due to the fact that the outlier is generated by multiplying X n by θ , which corresponds to inflating the most important value of x 2 (Step 2). Consequently, the Aitken-Gini estimator β ^ a g , 2 yields a robust estimation compared with GLS. For the other cases, the MSE of the generalized least squares estimator are less important.

4.2. Tests

The aim of this subsection is to prove that the usual White test for heteroskedasticity has a low power whenever outlying observations arise in the sample, even if the contamination rate is around 1%. Another test is proposed based on the co-Gini operator, and it is shown that a good power may be obtained compared with the standard White test.
Although GLS estimators may be affected by outliers, it is worth mentioning that Aitken-Gini estimators and GLS estimators are based on two different notions of heteroskedasticity. The Aitken-Gini one captures another type of variability, the co-Gini based on ranks, compared with GLS based on the variance. In the following, focus is put on White’s test since it is commonly employed in the literature.
White’s model and its Gini counterpart are given by,
y = X β + ε ε ^ i 2 = δ 0 + k = 1 K δ k x i k + k = 1 K γ k x i k 2 + u i
and,
y = X β g + ε g ε ^ g , i r ε g , i y ˜ i = δ 0 + k = 1 K δ k x i k + k = 1 K γ k x i k r i k + u g , i ,
where r ε g , i is the rank of ε g , i and r i k is the rank of x i k (within the vector x k ). The intuition of the White-Gini test is to exhibit the variables x k that depend on the rank of the individuals. This is the case for example when we regress incomes on age. We have the same intuition for White’s test performed with OLS. However, the squared residuals and the squared covariates may be inflated because of the outliers. In this respect, it is possible to use Eq.(White-Gini) to test for heteroskedasticity. It is noteworthy that this equation cannot be estimated by the semi-parametric Gini regression since the rank vector of x k and the rank vector of x k r k are collinear (⊗ being the Hadamard product). Consequently, both equations are estimated by OLS. The advantage of dealing with Eq.(White-Gini) is to capture the shape of heteroskedasticity in the presence of outliers. The standard White-OLS equation aims at capturing quadratic shapes in the covariates. However, the model fails to achieve this goal in the presence of outliers because outliers are squared. In the White-Gini equation, the product x i k r i k allows the quadratic shape to be detected, while the intensity of the outliers are attenuated by the role of the rank vector r k .
In the following tables, we provide the mean R 2 of each model over the number of Monte Carlo experiments B = 500 , 1000 , 5000 . We provide in parenthesis the power of the Fisher test related to the significance of the R 2 in each model. The Monte Carlo simulations with contamination were based on the same simulation process described in Algorithm 1.
In Table 1, one observation is contaminated for a sample size n = 30 , that is 3.33% of the sample. The same generating process was used as in the Monte Carlo simulations performed in the previous section. The outlying observation consists in multiplying the most important value of x 2 by θ . Without outlier, the White-Gini model provides an R 2 of 0.17 (in mean over B) with a very low test power around 2%, whereas the White-OLS model yields an R 2 of 0.26 with a power of 14%. However, when θ is valued to be 100, the White-Gini model performs quite well with an R 2 of 0.45 and a power of 51%, whereas the power decreases slightly in the White-OLS model. In the White-Gini model, thanks to the rank vector of x as a regressor, the regression curve stays distant from the outlying observation and becomes closer to the other points. Then, the variability of the model explained by the regression curve increases (and then R 2 increases). On the contrary, for the standard model (White-OLS), the regression curve moves toward the outlying observation so that the variability of the residuals increases, and in this case R 2 decreases.
In Table 2, the sample size is n = 100 so that the contamination represents 1% of the sample. Without outlier, the White-Gini model provides a very low test power around 7%, compared with 64–70% for White-OLS model. The test power increases to reach 70% in the first case against 59% in the second case.
Finally, in Table 3, R 2 and test power remain quite equivalent in both models. As mentioned in the literature, for large samples, White-OLS provides an excellent power. When the outliers are dilute in the sample, for instance when the contamination of the sample is only concerned with 0 , 1 % of the sample, because the sample size is large and the number of outliers very low, both tests produce the same power.
As shown in Table 1, Table 2 and Table 3, when outlying observations affect the sample, the power of the White-Gini test is higher than that of the usual White test.

4.3. The Feasible Generalized Gini Regression

After testing the presence of heteroskedasticity with outlying observations, a new procedure is proposed to estimate Ω . The so-called feasible generalized least squares (FGLS), introduced by Zellner (1962), was adapted to the Gini regression, the feasible generalized Gini regression (FGGR). Aitken’s theorem no longer applies if Ω is unknown and must be estimated. The feasible generalized least squares estimator is not the best linear unbiased estimator, nevertheless Kakwani (1967) proved that it is still unbiased under general conditions, and Schmidt (1976) discussed the fact that most of the properties of generalized least squares estimation remain intact in large samples, when plugging in an estimator of Ω . The form of the heteroskedasticity is unknown, but it can be approximated with a flexible model as,
ln ( e 2 ) = X b g + u ,
in a “Breusch–Pagan” version (in the sense that we consider a linear form of heteroskedasticity), such that E ( u ) = 1 and u independent of X . Instead of using the least squared estimator, the semi-parametric Gini estimator in Equation (2) is employed to deal with contaminated data in X :
b ^ g = ( R x X ) 1 R x ln ( e 2 ) .
Then,
h ^ i : = exp ( ln ( e ^ i 2 ) ) = exp ( x i b g ^ ) ,
such that,
Ω ^ : = h ^ 1 0 0 0 h ^ 2 0 0 0 h ^ n .
From Ω ^ , we deduce an estimation of P denoted P ^ = Ω ^ 1 2 . Thus, we get that X ^ : = Ω ^ 1 2 X . Let R x ^ be the rank matrix of X ^ , hence the FGGR estimator is given by:
β ^ F G G R = ( R x ^ X ^ ) 1 R x ^ y .
On the contrary, the usual FGLS estimator is given by,
β ^ F G L S = ( X ^ X ^ ) 1 X ^ y ,
with b g estimated by generalized least squares in the first step [Equation (9)]. However, in Proposition 2, it is shown that the Aitken-Gini estimator is based on the P -rank idempotent hypothesis. This assumption states that the rank vector of the residuals must remain invariant after the transformation of the model with respect to matrix P , thereby an Aitken estimator is obtained. However, whenever one outlier occurs in the sample, e.g., the ith row of X is such that X i o ± ), then it comes that ε g , i ± so that the respect of the P -rank idempotent hypothesis is not necessarily ensured for the outlying observation. Consequently, the FGGR estimator is biased. Indeed, the semi-parametric Gini estimator in Equation (10) is based on the rank matrix R x ^ of P X which contains errors since P is computed on the basis of contaminated data. Replacing R x ^ by R x , being the rank matrix of X , avoids such a contamination. It is worth mentioning that replacing the rank matrix of the covariates by another rank matrix of some data correlated to the covariates corresponds to the Gini instrumental variable estimator introduced by Yitzhaki and Schechtman (2004). In this respect, the FGGR estimator becomes a feasible generalized Gini regression by instrumental variable (FGGR-IV):
β ^ F G G R I V = ( R x X ^ ) 1 R x y .
Because R x is the rank matrix of the initial contaminated data, it comes that the residuals of the transformed model issued from the FGGR-IV estimator are more likely to respect the P -rank idempotent hypothesis compared with FGGR based on P X because both matrices are contaminated.
We performed some Monte Carlo simulations to compare the mean squared errors of the FGGR-IV and FGLS estimators.4
Step 1: Generate three independent normal distributions x j N ( 0 , 1 ) of size n = 100 for all j = 2 , 3 , 4 with x 1 = ( 1 , , 1 ) .
Step 2: As in the previous simulation, sort the matrix X = ( x 1 , , x 4 ) by ascending order according to x 2 , except multiply the last row of X by θ = 1 , , 100 (with increments of 1 to avoid problems of matrix invertibility) to inflate the most important value of x 2 .
Step 3: For each outlier θ , perform B = 1000 simulations, i.e., generate B × 3 independent normal distributions x j N ( 0 , 1 ) for all j = 2 , 3 , 4 with one outlier valued to be X n o : = θ X n .
Step 4: Fix a vector β a g = ( 10 , 3 , 10 , 58 ) to compute y = β a g , 1 x 1 + β a g , 2 x 2 + β a g , 3 x 3 x 3 + β a g , 4 x 4 + ε ˜ g with ε g , i N ( 0 , 1 ) , that is, suppose that the heteroskedasticity comes from x 3 .
Step 5: Compute the coefficients estimated based on FGGR-IV and FGLS with their MSEs over B = 1000 replications.
Figure 3 depicts an interesting correction of heteroskedasticity performed by the FGGR-IV estimator when outliers contaminate only 1% of the sample. The FGGR-IV estimator provides MSEs close to 0 except for the constant and for β ^ a g , 3 . Because the model has been specified such that β a g , 3 x 3 x 3 , then the outlier is even more inflated in this case (bottom left in Figure 3).
To show that the FGGR-IV estimator is relevant with other forms of heteroskedasticity, Step 4 was replaced by Step 4’, in which an ARCH(1) is modeled (see Figure 4):
Step 4’: Fix a vector β a g = ( 10 , 3 , 10 , 58 ) to compute y = β a g , 1 x 1 + β a g , 2 x 2 + β a g , 3 x 3 + β a g , 4 x 4 + ε ˜ g 1 + 2 ϵ 2 with ε g , i N ( 0 , 1 ) and ϵ i N ( 0 , 1 ) .
The results depicted in Figure 4 are even more clear: the MSEs of all FGGR-IV estimators tend toward 0, consequently the bias of those estimators also tend toward 0.

5. Concluding Remarks

In this paper, we have demonstrated that two equivalent Gini estimators may be proposed to deal with heteroskedasticity: the former deals with heteroskedasticity in the variance sense and the latter with heteroskedasticity in the Gini sense. The jackknife variance of these estimators are shown to be robust in the presence of outlying observations compared with the usual GLS technique, i.e., the loss of efficiency is less important in the Gini case. The simulations presented in Table 1, Table 2 and Table 3 show that a contamination of 1% of the sample may drastically affect the power of the White-OLS test, so that the White-Gini test may be preferred to detect the presence of heteroskedasticity when outlying observations occur in the sample.

Author Contributions

Conceptualization, S.M. and N.K.; methodology, A.C.; software, A.C. and S.M.; validation, O.H.N.; writing–original draft preparation, all authors; writing–review and editing, all authors.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Proposition 6: 
We follow the proof obtained by Ka and Mussard (2016) in the case of fixed effects panel Gini regressions. Let r k be the kth column of R x and x k the kth column of X , for all k = 1 , , K . Let β ^ a g = : ( β ^ a g 1 , , β ^ a g K ) . From Equation (3), we get that:
y = β ^ a g 1 x 1 + + β ^ a g K x K + ε g .
Hence, the following identities hold:5
cov ( y , r 1 ) = β ^ a g 1 cov ( x 1 , r 1 ) + + β ^ a g K cov ( x K , r 1 ) + cov ( ε g , r 1 ) cov ( y , r k ) = β ^ a g 1 cov ( x 1 , r k ) + + β ^ a g K cov ( x K , r k ) + cov ( ε g , r k ) cov ( y , r K ) = β ^ a g 1 cov ( x 1 , r K ) + + β ^ a g K cov ( x K , r K ) + cov ( ε g , r K ) .
Setting β ^ ε j : = cov ( ε g , r j ) cov ( x j , r j ) , β ^ 0 j : = cov ( y , r j ) cov ( x j , r j ) and β ^ k j : = cov ( x k , r j ) cov ( x j , r j ) , and dividing the three last equations by, respectively, cov ( x 1 , r 1 ) , cov ( x k , r k ) and cov ( x K , r K ) yields:
β ^ 01 = β ^ a g 1 + + β ^ a g K β ^ K 1 + β ^ ε 1 β ^ 0 k = β ^ a g 1 β ^ 1 k + + β ^ a g K β ^ K k + β ^ ε k β ^ 0 K = β ^ a g 1 β ^ 1 K + + β ^ a g K + β ^ ε K .
Now, we define the two following columns vectors b ^ 0 : = ( β ^ 01 , , β ^ 0 K ) and b ^ ε : = ( β ^ ε 1 , , β ^ ε K ) . Then, it comes:
β ^ a g = 1 β ^ 21 β ^ K 1 β ^ 1 K β ^ 2 K 1 1 β ^ 01 β ^ ε 1 β ^ 0 K β ^ ε K = : B ^ 1 b ^ 0 b ^ ε
The previous expression shows that the Aitken-Gini estimator β ^ a g is a function of slope coefficients of semi-parametric simple Gini regressions β ^ . . . Consequently, it is a semi-parametric Gini estimator.
Note that β ^ ε j , β ^ 0 j and β ^ k j are all ratios of U-statistics. Indeed, consider ( X , Y ) a continuous bivariate distribution with F and G the marginal cumulative distribution functions of X and Y, respectively. By Proposition 9.2 of Yitzhaki and Schechtman (2013), there exists an unbiased and consistent estimator U a of 4 cov ( Y , F ( X ) ) :
U a = 4 n m 1 i = 1 n ( 2 i 1 n ) y x ( i ) ,
where y x ( i ) is the value of y concomitant to the ith order statistic of x 1 , , x n . On the other hand, the U-statistic of 4 cov ( X , F ( X ) ) is (Proposition 9.1 of Yitzhaki and Schechtman 2013):
U b = 4 n m 1 i = 1 n ( 2 i 1 n ) x ( i ) .
Estimators U a and U b are unbiased and consistent estimators of 4 cov ( X , G ( Y ) ) and 4 cov ( X , F ( X ) ) , respectively.6 The estimators β ^ ε j , β ^ 0 j and β ^ k j are ratios of two dependent U-statistics, such as U l : = U a / U b . By Slutzky’s theorem, because U a and U b are consistent estimators, then U l is also a consistent estimator. By Theorem 10.4 in Yitzhaki and Schechtman (2013), if there exists a real-valued function g ( θ 1 , , θ l ) of parameters θ i of the population, and if there exist U-statistics U = ( U 1 , , U l ) corresponding to θ = ( θ 1 , , θ l ) such that g and its derivatives are continuous in the neighbourhood of ( θ 1 , , θ l ) , then n ( g ( U ) g ( θ ) ) a N . Because the estimation of the variance of g ( U 1 , , U l ) can be made by jackknife, there is no need to postulate the existence of the second moments E { ϕ 2 ( X 1 , , X m ) } . From Equation (A1), since each element β ^ a g k of β ^ a g is a function of ratios of U-statistics being consistent, then applying Theorem 10.4 in Yitzhaki and Schechtman (2013), it comes that β ^ a g k is a consistent estimator of β a g k such that β ^ a g k a N , for all k = 1 , , K . □

References

  1. Aitken, Alexander Craig. 1935. On least squares and combinations of observations. Proceedings of the Royal Society of Edinburgh 55: 42–48. [Google Scholar] [CrossRef]
  2. Carcea, Marcel, and Robert Serfling. 2015. A Gini autocovariance function for time series modeling. Journal of Time Series Analysis 36: 817–38. [Google Scholar] [CrossRef]
  3. Ka, Ndéné, and Stéphane Mussard. 2016. 1 Regressions: Gini estimators for fixed effects panel data. Journal of Applied Statistics 43: 1436–46. [Google Scholar] [CrossRef]
  4. Kakwani, Nanak. 1967. The unbiasedness of Zellner’s seemingly unrelated regression equation estimators. Journal of the American Statistical Association 82: 141–42. [Google Scholar] [CrossRef]
  5. Mussard, Stéphane, and Oumar Hamady Ndiaye. 2018. Vector autoregressive models: A Gini approach. Physica A 492: 1967–79. [Google Scholar] [CrossRef]
  6. Olkin, Ingram, and Shlomo Yitzhaki. 1992. Gini regression analysis. International Statistical Review 60: 185–96. [Google Scholar] [CrossRef]
  7. Schechtman, Edna, and Shlomo Yitzhaki. 1987. A Measure of association based on Gin’s mean difference. Communications in Statistics—Theory and Methods 16: 207–31. [Google Scholar] [CrossRef]
  8. Schmidt, Peter. 1976. Econometrics. New York: Marcel Dekker. [Google Scholar]
  9. Serfling, Robert J. 1980. Approximation Theorems of Mathematical Statistics. New York: John Wiley & Sons. [Google Scholar]
  10. Shelef, Amit, and Edna Schechtman. 2011. A Gini-based methodology for identifying and analyzing time series with non-normal innovations. SSNR Electronic Journal, 1–26. [Google Scholar] [CrossRef]
  11. Yitzhaki, Shlomo, and Edna Schechtman. 2004. The Gini Instrumental Variable or the “double instrumental variable” estimator. Metron LXII: 287–313. [Google Scholar]
  12. Yitzhaki, Shlomo, and Edna Schechtman. 2013. The Gini Methodology. A Primer on a Statistical Methodology. New York: Springer. [Google Scholar]
  13. Zellner, Arnold. 1962. An efficient method of estimating seemingly unrelated regressions and tests for aggregation bias. Journal of the American Statistical Association 57: 348–68. [Google Scholar] [CrossRef]
1
This technique corresponds to the weighted least squares. Mathematically, things can be extended to the case where Ω is not diagonal using the singular value decomposition, but interpretation is much harder. Only the diagonal case is studied here.
2
The regression curve of the Gini regression does not require any linear assumption of the model. Only a linear approximation is necessary to estimate the error term.
3
If ϕ is not symmetric in its arguments, we can also average over the m ! permutations
ϕ ( X 1 , X 2 , , X m ) = ( m ! ) 1 p ϕ ( x i 1 , x i 2 , , x i m ) ,
with p the sum over all permutations of ( 1 , , m ) .
4
The results of FGGR are not reported because of their bad results compared with FGLS.
5
This technique was introduced by Yitzhaki and Schechtman (2013, chp. 8) in the case of the standard Gini regression.
6
Note that, for small samples, both U a and U b are biased downward.
Figure 1. Standard deviations of the coefficients.
Figure 1. Standard deviations of the coefficients.
Econometrics 07 00004 g001
Figure 2. Mean squared errors of the coefficients.
Figure 2. Mean squared errors of the coefficients.
Econometrics 07 00004 g002
Figure 3. Mean squared errors of the coefficients: FGGR-IV and FGLS.
Figure 3. Mean squared errors of the coefficients: FGGR-IV and FGLS.
Econometrics 07 00004 g003
Figure 4. Mean squared errors of the coefficients: FGGR-IV and FGLS with ARCH(1).
Figure 4. Mean squared errors of the coefficients: FGGR-IV and FGLS with ARCH(1).
Econometrics 07 00004 g004
Table 1. Power of the White-Gini test: small sample n = 30 .
Table 1. Power of the White-Gini test: small sample n = 30 .
Without Outliers: θ = 1 With Outliers: θ = 100
B = R 2 ¯ (White-Gini) R 2 ¯ (White-OLS) R 2 ¯ (White-Gini) R 2 ¯ (White-OLS)
5000.17 (0.02)0.25 (0.11)0.45 (0.51)0.24 (0.09)
10000.17 (0.03)0.26 (0.14)0.45 (0.51)0.24 (0.09)
50000.17 (0.02)0.26 (0.14)0.45 (0.50)0.24 (0.09)
Table 1: Contamination 3.33% of the sample; (), power of the test.
Table 2. Power of the White-Gini test: n = 100 .
Table 2. Power of the White-Gini test: n = 100 .
Without Outliers: θ = 1 With Outliers: θ = 100
B = R 2 ¯ (White-Gini) R 2 ¯ (White-OLS) R 2 ¯ (White-Gini) R 2 ¯ (White-OLS)
5000.08 (0.07)0.15 (0.64)0.32 (0.69)0.14 (0.59)
10000.08 (0.07)0.15 (0.7)0.32 (0.7)0.14 (0.59)
50000.08 (0.08)0.20 (0.65)0.32 (0.7)0.14 (0.58)
Table 2: Contamination 1% of the sample; (), power of the test.
Table 3. Power of the White-Gini test: n = 1000 .
Table 3. Power of the White-Gini test: n = 1000 .
Without Outliers: θ = 1 With Outliers: θ = 100
B = R 2 ¯ (White-Gini) R 2 ¯ (White-OLS) R 2 ¯ (White-Gini) R 2 ¯ (White-OLS)
5000.05 (1)0.11 (1)0.16 (1)0.11 (1)
1,0000.05 (1)0.11 (1)0.16 (1)0.11 (1)
5,0000.05 (1)0.11 (1)0.15 (1)0.11 (1)
Table 3: Contamination 0.1% of the sample; (), power of the test.

Share and Cite

MDPI and ACS Style

Charpentier, A.; Ka, N.; Mussard, S.; Ndiaye, O.H. Gini Regressions and Heteroskedasticity. Econometrics 2019, 7, 4. https://doi.org/10.3390/econometrics7010004

AMA Style

Charpentier A, Ka N, Mussard S, Ndiaye OH. Gini Regressions and Heteroskedasticity. Econometrics. 2019; 7(1):4. https://doi.org/10.3390/econometrics7010004

Chicago/Turabian Style

Charpentier, Arthur, Ndéné Ka, Stéphane Mussard, and Oumar Hamady Ndiaye. 2019. "Gini Regressions and Heteroskedasticity" Econometrics 7, no. 1: 4. https://doi.org/10.3390/econometrics7010004

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop