Modeling in R
Sudha PurohitAim of Course:
In this course you will learn how to use R to build statistical models and use them to analyze data. Multiple regression is covered first followed by logistic regression. The generalized linear model is then introduced and shown to include multiple regression and logistic regression as special cases. The Poisson model for count data will be introduced and the concept of overdispersion described. You will then learn how to analyse longitudinal data, first using relatively straightforward graphics and simple inferential approaches. This will be followed by describing mixed-effects models and the generalized estimating approach for such data. The emphasis in the course is how to use R to fit the models listed and how to interpret the R output, rather than the theoretical background of the models. Consequently some knowledge of linear models is required (statistics.com has courses in all of them).Who Should Take This Course:
Anyone who is familiar with R and wants to learn how to use it to build and use statistical models. Important: the course will cover a variety of techniques, and it is expected that most participants will be interested in learning how to use R for some, but not all of them. If you expect to cover all the techniques as you take the course, you should budget a bit more time than the 10-15 hours suggested.For those enrolled in Professional Advancement Programs, this is a required or elective course in the following Programs:
- Biostatistics (controlled trials) - elective
- Biostatistics (epidemiology) - elective
Course Program:
The course is structured as followsSESSION 1: Linear Regression, Logistic Regression
- Multiple linear regression with R
- Simple examples, dummy explanatory variables, interpreting regression coefficients. Finding a parsimonious model. Regression diagnostics. Detailed analysis of some cloud seeding data.
- Logistic regression with R.
- The need for a different model when the response variable is binary. The logistic transform and fitting the model to some simple examples. Deviance residuals.
- Multiple regression and logistic regression as special cases of the generalized linear model.
- The Poisson model for count data.
- Colonic polyps example. The problem of overdispersion.
- Examples of longitudinal data.
- Simple graphics for longitudinal data and simple inference using the summary measure approach.
- The 'long form' of longitudinal data.
- Models for longitudinal data when independence of the repeated measurements is assumed.
- Modeling the correlational structure of the repeated measurements.
- The generalized estimating equation approach for non-normal response variables in longitudinal data.
- The dropout problem.
The Instructor:
Dr. Sudha Purohit is a Visiting Lecturer in Statistics at the University of Pune and, before her retirement in 2000, was Head of the Department of Statistics at A. G. College, Pune, India. She is a co-author of four books, Life-Time Data: Statistical Models and Methods, (with Prof. J. V. Deshpande), Introduction to Biometry , Microarray Data: Statistical Analysis Using R (with Prof. Shailaja Deshmukh) and Statistics Using R (jointly with Dr. Sharad Gore, Prof. Shailaja Deshmukh). Her areas of interest are survival analysis, reliability, programming with R and analysis of microarray data. She has published a number of research papers in various peer-reviewed journals.Organization of the Course:
The course takes place over the internet, at statistics.com. During each course week, you participate at times of your own choosing - there are no set times when you must be online. Course participants will be given access to a private discussion board. In class discussions led by the instructor, you can post questions, seek clarification, and interact with your fellow students and the instructor. The course is scheduled to take place over 4 weeks, and typically requires 10-15 hours per week. At the beginning of each week, you receive the relevant material, in addition to answers to exercises from the previous session. During the week, you are expected to go over the course materials and work through exercises. Discussion among participants is encouraged. The instructor will provide answers and comments.Certificates and Grades:
You may be interested only in learning the material presented, and not be concerned with grades or certificates. Or you may be enrolled in a statistics.com Professional Advancement Program that requires demonstration of proficiency in the subject, in which case your work will be assessed for purposes of issuing a grade. Or you may require only a "Certificate of Course Completion," along with professional development credit in the form of Continuing Education Units (CEU's). As you begin the class, you will be asked to specify your category.Credit:
This course offers continuing education units (CEU's). For those successfully completing the course (generally this means marks of 50% or better on the homework), 5.0 CEU's and a certificate will be issued by statistics.com, upon request.Dates:
May. 2 - May. 30, 2008Aug. 29 - Sep. 26, 2008
Click here to be notified of future course offerings.
Participants gain access to the online materials on the first day of the course, and typically spend about 10-15 hours per week (at their convenience). You retain full access to course materials, including discussion board, for two weeks after the course closing date.
Level:
Advanced/IntermediatePrerequisite:
You should be familiar with (1) the basic use of R - see Introduction to R - Data Handling and Introduction to R - Statistical Analysis, and (2) statistical modeling techniques. Statistics.com has a variety of courses in the modeling techniques themselves that will also serve as pre-requisites for that aspect of the course.Course Text:
The course text is "A Handbook of Statical Analysis Using R" by Everitt and Hothorn, from Chapman & Hall/CRC. PLEASE ORDER YOUR COPY IN TIME FOR THE COURSE STARTING DATE. CRC Press usually offers a 25% discount when this form is used to place your order.Software:
Students must have access to R. For information on obtaining a copy of R, please Click Here.Registration:
Register Online - $449Register Online (academic) - $349 (you must be affiliated with a college, university or high school)
Add $50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.
Note: Courses may fill up at any time and registrations are processed in the order in which they are received. Your registration will be confirmed for the first available course date, unless you specify otherwise.
