Xycoon logo
Econometrics
Home    Site Map    Site Search    Xycoon College    Free Online Software    
horizontal divider
vertical whitespace

Introduction to Econometrics - Definitions and Properties of RandomVariables

[Home] [Up] [Probability] [Axiom System] [Bayes Theorem] [Matrix Algebra] [Distribution Theory] [Estimator Properties] [Random Variables]


Many statistical models may be represented by

Introduction to Econometrics - Definitions and Properties of RandomVariables

(I.III-1)

with the three elements

(I.III-2)

Sometimes it is possible to describe the probability measure in (I.III-2) by the probability density function (pdf)

(I.III-3)

If a sample of observations is independent, identically distributed (iid) then the pdf can be written as

(I.III-4)

In Bayesian statistics an additional element is available. A prior pdf

(I.III-5)

is introduced into the model

(I.III-6)

The prior pdf represents prior information about the possible parameter values (without using the observations).

(I.III-7)

(I.III-8)

The aim of a lot of statistical research is to provide adequate theories and procedures in order to be able to

A variable is called a random (c.q. stochastic) variable if the possible values of the variable have different probabilities. Therefore a random variable always has a probability density function (pdf) and a probability distribution.

The relationship between distributions and probabilities can be defined as

(I.III-9)

where f(X) is the probability density function.

A cumulative probability distribution is defined as follows

(I.III-10)

for discrete distributions, or as

(I.III-11)

for continuous distributions.

There are three properties of cumulative probability distributions (c.q. ogives):

Furthermore there is the very important notion of expectation that we 'll define now as

(I.III-12)

or

(I.III-13)

according to Jeffreys' definition. Both expressions (I.III-12) and (I.III-13) are formulated for discrete variables.

Alternate definitions for continuous variables are

(I.III-14)

or

(I.III-15)

according to Jeffreys.

Some very interesting properties of expectations have to be considered in order to provide us a sound mathematical basis for several proofs and theorems to be discussed in the following chapters:

  1. E(a X + b) = a E(X) + b where E(b) = b

  2. E(g(X) + h(X)) = E(g(X)) + E(h(X))

  3. E(X + Y) = E(X) + E(Y)

  4. E(X Y) = E(X) E(Y)

if, and only if

(a and b are real numbers).

The mathematical expectation can be thought of as the most probable value of a variable.

The variance of a random variable is another very important property of probability distributions. It is in fact a measure for the spread of a stochastic variable. The easiest way to define a variance is by means of mathematical expectations as

(I.III-16)

or

(I.III-17)

Equation (I.III-17) can be proved using (I.III-16) and the four properties of expectations as follows

Of course variances have properties similar to the properties of expectations

(I.III-18)

and

(I.III-19)

It can be shown that centered moments can quite easily be computed from uncentered moments

(I.III-20)

(see also eq. (I.III-17)).

A probability distribution function can be characterized by its centered and uncentered moments (e.g., mean, variance, ...). Therefore a general method of deriving uncentered moments would be a very nice thing to have (the centered moments would then be computed from the conversion formulae (I.III-20)). This is not wishful thinking since the moment generating function of a discrete stochastic variable

(I.III-21)

(I.III-22)

Eq. (I.III-22) can be proved as

(I.III-23)

The covariance between two random variables can be defined easily using mathematical expectations

(I.III-24)

or equivalently

(I.III-25)

whereas the correlation is derived from the covariance as

(I.III-26)

The correlation between X and Y, as defined in, (I.III-26) lies between -1 and 1.

Intuitively the covariance and the correlation can be thought of as a measure of collinearity of the points in a scatter plot (a plot of variable Y against the corresponding values of X). The only difference between covariance and correlation is that the latter has been standardized and is therefore independent of both variables' dimension.

Above this, it is important to understand the following relationship

(I.III-27)

which clearly states that the implication is valid in one direction only, since the covariance and the correlation are by definition measures for linear relationships while the dependence of two random variables could by of any kind (linear and nonlinear)!

So far nothing has been said about the probability distributions of the random variables. Theoretically of course there are an infinite number of possible probability density functions but only few of them are worthwhile discussing shortly because of their immense importance in econometrics and statistics.

The binomial distribution can be defined as

(I.III-28)

where

(I.III-29)

and

p = chance of a success
q = chance of a failure
p + q = 1
n = number of independent draws
X = number of successes.

Furthermore E(X) = n p and V(X) = n p q. This can be proved by using the moment generating function (I.III-21) of (I.III-28) and applying (I.III-22) to it.

The binomial distribution is of huge importance in quality control and experimental design.

The most important probability distribution in statistics though is the normal distribution. This function is defined as follows

(I.III-30)

with

(I.III-31)

The mean and variance can be derived from the moment generating function. Above that, any normal distribution function is perfectly identified by its mean and variance.

The fact that a random variable is normally distributed can be denoted as

(I.III-32)

Having introduced this kind of notation for normal distributions, it is quite easy to describe the socalled additivity property.

(I.III-33)

then, it follows that

(I.III-34)

In other words, a linear function of normally distributed random variables is also normally distributed!

Proof for the generalized additivity property for independent variables is straightforward

(I.III-35)

(for independent Xt variables) since on using the moment generating function we obtain

(I.III-36)

(I.III-37)

(I.III-38)

(I.III-39)

(I.III-40)

according to the explicit expression of the moment generating function of normal stochastic variables, yielding finally

(I.III-41)

which proves the property (Q.E.D.).

Analogously a generalized additivity property for dependent variables can be obtained (see eq. (I.III-33) and (I.III-34)). This is however not necessary in our further discussions and thus beyond the scope of this work.

Derived from the normal distribution is the log-normal distribution. A random variable is log-normally distributed if, and only if

(I.III-42)

and if (by definition of ln)

(I.III-43)

The expectation and variance of this probability density function is given by

(I.III-44)

As previously stated a probability distribution function of a random variable can be formulated using probability theory, e.g., in (I.III-9) and (I.III-11). It is however imperative to formalize another link between probability and distribution theory: the Bienaymé-Chebyshev theorem (or inequality).

(I.III-45)

Since we defined in (I.III-45)

(I.III-46)

it is obvious that, on combining (I.III-45) and (I.III-46), the following holds

(I.III-47)

which is the Bienaymé-Chebyshev inequality.

vertical whitespace




Home
Up
Probability
Axiom System
Bayes Theorem
Matrix Algebra
Distribution Theory
Estimator Properties
Random Variables
horizontal divider
NEWS FEED from BBC News : Statistical Research
'Huge contrasts' in devolved NHSSeparate health policies across the UK are leading to wide differences for patients, it is claimed.
The school uniform price warWith children preparing to return to school, a price war has broken out over school uniforms. aimed at parents who are feeling the credit squeeze.
Britain's happiest places mappedThe most sparsely populated county in Wales is Britain's happiest place - but Edinburgh the least happy, say researchers.
Shetland obesity study dismissedHealth chiefs on Shetland dismiss research which claims the islands are the fattest area of the UK as "misleading and simplistic".
Study reveals teen self-harm rateA study suggests that up to 14% of 15 and 16-year-olds across central Scotland have self-harmed.
'Complexity' of Neanderthal toolsEarly stone tools developed by modern humans were no more advanced than ones used by the Neanderthals, research shows.
Councils urged to retain migrantsThe government should plan better so communities are boosted by the economic benefits generated by migrant workers, a think tank says.
In quotes: UK economic growth stallsUK economic growth ground to a halt between April and June, official figures show, heightening fears of a recession.
McCain cheers gloomy RepublicansThe BBC's Richard Lister examines looks at John McCain' strong performance in the opinion polls, despite the media's focus on his rival.
Drop in East European migrationThe number of Eastern European migrants coming to work in the UK has fallen to its lowest level since 2004, statistics show.
Public finances improve slightlyBritain's public finances get a boost from strong North Sea oil tax revenue, but are still weaker than a year ago.
Putting percentages in contextPercentages might make news stories stand out, but without a connection to the human experience, can become meaningless. In his third lesson of a weekly series, author Michael Blastland gives some hints for percentages.
Rise in people accessing HIV careThe number of people in the north west of England accessing HIV treatment is continuing to rise, new research says.
Traditional names are 'dying out'Traditional British names such as Norman and Gertrude Edna are in danger of dying out, a survey suggests.
Feeling the heat of food securityWhy reforming the economics of food production and supply would be beneficial for a number of environmental and social ills.
Bosses say education has worsenedSenior managers say education has declined over the last 10 years, despite record exam results.
Teens' test results to be issuedThe secondary school Sats test results for England are to be published although many are missing.
GB 'to meet' Olympic medal targetTeam GB should exceed a target of 41 medals at the Olympics demanded by sports minister Gerry Sutcliffe, research shows.
Call for better chemo counsellingDoctors are too worried to give patients with terminal cancer precise information about the benefits of extra chemotherapy, say researchers.
Upgrading road 'cut' fatalitiesResearch by surgeons claims the upgrading of a road cut the number of fatal accidents to zero within a year.
horizontal divider

© 2000-2006 - Office for Research, Development, and Education (called ORDE) - All rights reserved. This website is published by ORDE and owned by Resa R&D. This includes: html content, graphical illustrations (gif, jpg, and png files), computer software, online or electronic documentation, associated media, and printed materials. All Photographs (jpg files) are the property of Corel Corporation, Microsoft and their licensors. ORDE has acquired a non-transferable license to use these pictures in this website.
The free use of the scientific content in this website is granted for non commercial use only. In any case, the source (url) should always be clearly displayed. Under no circumstances are you allowed to reproduce, copy or redistribute the design, layout, or any content of this website (for commercial use) including any materials contained herein without the express written permission of ORDE.

Information provided on this web site is provided "AS IS" without warranty of any kind, either express or implied, including, without limitation, warranties of merchantability, fitness for a particular purpose, and noninfringement. ORDE uses reasonable efforts to include accurate and timely information and periodically updates the information without notice. However, ORDE makes no warranties or representations as to the accuracy or completeness of such information, and it assumes no liability or responsibility for errors or omissions in the content of this web site. Your use of this web site is AT YOUR OWN RISK. Under no circumstances and under no legal theory shall ORDE be liable to you or any other person for any direct, indirect, special, incidental, exemplary, or consequential damages arising from your access to, or use of, this web site.

Contributions and Scientific Research: Prof. Dr. E. Borghers, Prof. Dr. P. Wessa
Please, cite this website when used in publications: Xycoon (or Authors), Statistics - Econometrics - Forecasting (Title), Office for Research Development and Education (Publisher), http://www.xycoon.com/ (URL), (access or printout date).
Facilities, development, and design: Office for Research, Development, and Education

Comments, Feedback, Bugs, Errors | Privacy Policy Web Awards

This website is kindly sponsored by: Bandwidth Control | Time Series, Statistics Resources, and Statistics Software | Computer schools and technology degrees