Estimator Properties & MLE

If Z1 and Z2 are 2 independent standard normal random variables, then the characteristic function of (Z1+Z2) is:
1. Exp(-t)
2. Exp(-2t)
3. Exp(-t/2)
4. None of the above
The "Risk" associated with any decision rule is:
1. The expected loss, where the expectation is taken with respect to the uncertainty associated with the parameters
2. The risk of a scalar estimator is gnerally less than its variance
3. The risk of a vector estimator is the trace of its matrix mean squared error
4. The risk of a vector estimator is just its matrix mean squared error
If the loss function is quadratic, then:
1. The risk of a scalar estimator is just its variance
2. The risk of a scalar estimator is gnerally less than its variance
3. The risk of a vector estimator is the trace of its matrix mean squared error
4. The risk of a vector estimator is just its matrix mean squared error
If an estimator is "inadmissible", then:
1. There is at least one other estimator whose loss is less than or equal to the loss of this estimator everywhere in the parameter space, and strictly less somewhere in the parameter space
2. There is at least one other estimator whose risk is less than or equal to the risk of this estimator everywhere in the parameter space, and strictly less somewhere in the parameter space.
3. There is at least one other estimator whose risk is strictly less than the risk of this estimator everywhere in the parameter space
4. It cannot be weakly consistent
If an estimator is "Mini-Max", then:
1. It must be admissible
2. It cannot be admissible
3. It may be admissible or inadmissible
4. Its risk function must "cross" the risk function of at least one other estimatar
If an scalar statistic is "sufficient", then:
1. It will be an admissible estimator of the population parameter
2. It will be an efficient estimator of the population parameter
3. It will be an unbiased estimator of the population parameter
4. It contains all of the sample information that is needed to estimate the population parameter
The Newton-Raphson algorithm:
1. May yield multiple solutions, all of which will be local maxima, and one of which will be the global maximum
2. May yield multiple solutions. some of which relate to local maxima and some of which relate to local minima
3. Will always converge to a global extremum in a finite number of iterations
4. Will converge in 3 steps if the underlying function is a cubic polynomial
The "Invariance" property of MLE's implies that:
1. Their variance approaches zero as the sample size increases without limit
2. Their variance achieves the Cramer-Rao lower bound
3. Any monotonic function of an MLE is the MLE for that function of the parameter(s)
4. Any continuous function of an MLE is the MLE for that function of the parameter(s)
If X follows a uniform distribution on [0 , 1], and Y = 5X, then:
1. The Jacobian for the mapping from X to Y is 0.2, and Y is uniform on [0 , 5]
2. The Jacobian for the mapping from X to Y is 5, and Y is uniform on [0 , 0.2]
3. The Jacobian for the mapping from X to Y is 0.2, and Y is uniform on [0 , 0.2]
4. The Jacobian for the mapping from X to Y is 5, and Y is uniform on [0 , 5]
When we evaluate the Jacobian associated with a transformation from one probability distribution to another:
1. We use the absolute value because a density function cannot take negative values
2. We must be dealing with scalar random variables, not random vectors
3. The intention is make sure that the support of the new random variable is the full real line
4. The intention is make sure that the support of the new random variable is the positive half of the real line
If our random data are statistically independent, then:
1. The likelihood function is just the sum of the marginal data densities, viewed as a function of the parameter(s)
2. The log-likelihood function is just the product of the logarithms of the marginal data densities, viewed as a function of the parameter(s)
3. The log-likelihood function is just the sum of the logarithms of the marginal data densities, viewed as a function of the parameter(s)
4. The likelihood function will have a unique turning point, and this will be a maximum (not a minimum) if the sample size is large enough
The "Likelihood Equations" are:
1. The same as the "normal equations" associated with least squares estimation of the multiple linear regression model
2. Guaranteed to have a unique solution if the sample data are independent
3. Obtained by getting the second derivatives of the log-likelihood function with respect to each of the parameters, and setting these equal to zero
4. The first-order conditions that we have to solve in order to maximize the likelihood function
When we "concentrate" the likelihood function, the objective is to:
1. Focus attention on just the important parameters by conditioning on the 'nuisance parameters' in the problem
2. Reduce the dimension of that part of the optimization problem that has to be solved numerically
3. Take a monotonic transformation of the likelihood function so that it is easier to find the global maximum
4. Convert what would be a non-linear optimization problem into one that is approximately linear
Suppose that Y follows a Binomial distribution with parameter 'p' equal to the probability of a 'success', and 'n' repetitions. Then the MLE of the standard deviation of Y is:
1. The square root of np(1-p)
2. The square root of y(n-y)/n, where y is the observed number of 'successes' in the sample
3. The square root of n(y-n)/y, where y is the observed number of 'successes' in the sample
4. The square root of ny, where y is the observed number of 'successes' in the sample
The connection between a sufficient statistic and an MLE is:
1. A sufficient statistic is always an MLE
2. There is no connection in general
3. All MLE's are linear combinations of sufficient statistics
4. If an MLE is unique, then it must be a function of a sufficient statistic

Estimator Properties & MLE

Multiple Choice Questions