STAT 4870/7870 - Fall 2019Final ProjectDue Thursday, December 5, 2019 (by 5:00 pm by email)If you have any questions please come to office hours or email me or set up an appointment!Note: Projects will be worked on individually or in groups of two. Note that projects worked on in groupsof two require prior approval of the group formed and should be obtained as soon as possible (by 10/8/19).Option 1:STAT 4870:Choose an interesting time series that can be modeled using a seasonal ARIMA(p, d, q) × (P, D, Q)s model(see Section 3.9 tsaEZ). Determine two good models for your data using the methods learned throughoutthe semester. Obtain forecast intervals for 2s time periods into the future. Write a report based on youranalysis.STAT 7870:A real time series, should be used for the analysis. You must choose a dataset where the analysis involvesa method of analysis that we have not formally discussed in the class. Please see me to discuss this whenyou have some ideas.Guidelines for all students:1. Briefly describe the process from which your realization came and give the source of your data.Explain the purpose of your analysis. Include a table of the data.2. Decide on two final models, both of which fit the data well. Compare forecasts for these best twomodels, and briefly comment on any differences.3. Your reports should be concise but complete. The final written report should be typed and includea one-page non-technical summary of the findings, followed by the background of the scientificquestions, the body of technical analyses with interpretations, a conclusion and the listing of thedata. Including graphics, the report should not exceed 8 pages in length (unless prior approval isgiven). Reports should not include any raw computer output. Use cutting and pasting to presentonly relevant computer output and graphs. Output is relevant if you refer to it in your write up.4. Your project will be graded on accuracy, completeness, relevance of your graphical displays, tabularcomparisons, and your statements, and on the report’s organization, conciseness, and neatness.Whenever possible use graphs plus a few words to make your points. Number your figures and referto them in your write up. Reports must be typed or done using a word processing system.5. On a separate (but attached) page provide a one paragraph description (for each student) detailingthe contributions made to the final project.How to choose data for the final project:Most importantly, the process that generates data should be interesting to YOU. You should understandthe basics of the data you work with (For example, do not choose outcomes of rugby tournaments if STAT 4870/7870代写、R程序设计调试、代写R、代you1do not know the rules of the game and do not want to learn them as a part of your project.) The datashould be long enough to ensure that you can perform meaningful statistical analysis. I suggest you stick totime series with at least 180 observations (the more the better). DO NOT choose a dataset just becauseyou found how other people analyzed it and want to use this as a guide. This will seriously hurt your grade.A one-page proposal outlining the scientific questions to be addressed and the relevant techniquesto be employed, with a separate listing of the data, has to be turned in by email onFriday October 25 (by 5 pm).Option 2 (STAT 7870 Only): Use R to carry out a simulation studyPropose a simulation study to be investigated in R. Note that this option is suggested for graduate studentsonly, or those with good programming skills and a good background in statistical inference. Feel free torun your proposed idea by me before beginning. I can also email you with ideas if you are having a hardtime coming up with one. Some example problems include:1. Consider the ARIMA (p, 0, 0) model, that is, a pure AR model. Design a simulation with studies theeffect of the error distribution on the behavior of the MLE estimator(s) φi. You should have factorssample size n, true parameter value(s), and error distribution.2. Consider the ARIMA (p, 0, 0) model, that is, a pure AR model. The R function for fitting an autoregressivemodel an output vector asy.var.coef of asymptotic variance of the MLE parameterestimates. Design a simulation with compares the Monte Carlo estimate of variance to these asymptoticvariances, as function of n (The two should get closer as n increases.)3. Design a simulation with studies the performance of the estimated autocorrelation function. (Recallthat the true autocorrelation function for ARMA model can be obtained using the R commandARMAacf.) There are many situations here, and you will have to choose a particular one. Certainlysample size will be a factor, but you should also study the effect of other factors.A one-page proposal outlining the specific simulation study to be addressed and the relevanttechniques to be employed has to be turned in by email on Friday October 25 (by 5 pm).Some possible DATA Sources: (You can use others that are not listed here.)Time Series Data Library:https://datamarket.com/data/list/?q=provider:tsdlStatLib Datasets Library at Carnegie Mellon:http://lib.stat.cmu.edu/datasets/NBER Data - National Bureau of Economic Researchhttp://www.nber.org/data/Federal Reserve Economic Datahttp://research.stlouisfed.org/fred2/DATA.GOVhttps://www.data.gov/2转自:http://www.3daixie.com/contents/11/3444.html
讲解:STAT 4870/7870、R、R、ARIMA Statistics、、|Web
©著作权归作者所有,转载或内容合作请联系作者
- 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
- 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
- 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...