The goal here is to observe the general central limit theorem in computer simulation. |
| ||
X
be a uniform random variable and let Y
be the random variable given by Y
, first use simulation to generate one DataDesk variable XR
containing 200
Realizations of X
.
We will refer to this DataDesk variable as YR
which contains 200 Realizations of the random variable Y
. By default, DataDesk will name it something like YR
.
X
, XR
using the menu entry Manip → Generate Random Numbers... Specify 1
variable with 200
cases, and select the uniform distribution.
Y
, YR
from XR
( Manip → Transform → Sqrt(y) ).
YR
‡
YR
, does Y
appear to have a uniform distribution? Does Y
appear to have a Normal Distribution? Why or why not? Do the heights of the histogram bars generally increase or decrease as the value of Y
increases between 0
and 1
? Explain intuitively in terms of the graph of Y
and standard deviation of Y
?
Y
are exactly n | exact mean | mean(YBAR) (From Simulation) | exact SD | SD(YBAR) (From Simulation) | exact variance | variance(YBAR) (From Simulation) |
1 | ![]() | ![]() | ![]() | |||
5 | ||||||
10 | ![]() | |||||
40 |
Y
above with
5
what are the exact values of the mean and standard deviation of For n =5
, generate 200 realizations of. Computer Hint Note this means that you first have to generate
5
variables (each has 200 cases) of realizationsusing the method of Problem I. After you have generated these
5
variables, average them by the following steps: First use the menu entry Data → New → Derived Variable to write a formula. Assign the nameYBAR
to this variable. If your5
variables with the same distribution asY
are named, then you'd like to enter
as the defining formula of the derived variable
YBAR
. A way to do this with less typing is to select the5
iconsand drag them into the window where
YBAR
is being defined. This will give a list of these variable separated by commas. Now edit the commas, changing them to plus signs. This technique will really help on question III below when we ask you to do this for40
variables! An alternative is suggested at the end of the assignment.
YBAR
, estimate the mean ‡ and the variance ‡ of YBAR
. How do they compare to the exact values in (1)?
YBAR
. Does the distribution look normal?
10
and n = 40
and fill in the rest of the table above.
( You need to redo all three parts in II for n = 10
and n = 40
).
For n = 40
, what does the Central Limit Theorem say that the approximate distribution of YBAR is?
What are the mean and standard deviation of this approximating distribution?
Note in selecting 40
icons, you can just use your mouse, starting at the leftmost, and while holding the mouse button down, drag a rectangle touching all 40
icons to be included in the definition of YBAR.
You will have to change 39 commas to plus signs to come up with this informative average. You can do them one at a time in DataDesk, but a better way is to copy and paste into a text editor with search and replace. Do the search and replace and then copy and paste back into DataDesk. Note: When you copy/paste into your text editor, you may find that the √ symbol becomes something else. Don't worry about it. It should return to its original form when pasted back into DataDesk after you have done the replace/copy operation.
200
variables of 40
cases each.
All 200 variables are selected at this point.
Use Manip → Transform →