 I'd appreciate some suggestions or advice on analyzing the following type of data. Somebody here has school-level data on the number of EMS calls made during a semester for kids. The element that I need help with is that the school district decided it needed to preserve something and so recoded the data so that schools with 1 thru 5 calls in a semester were given a value of 3, otherwise the true value was recorded. So the distribution looks, for example, like 0, 3, 6, 8, 9, 12, etc. I have the enrollment at each school so I can compute a rate but because of the grouping it is not accurate. What am asking for is direct advice from anybody who has analyzed such data, references to articles about how such data can be/has been analyzed, how this type of data would be described as for a search term (I know that work has been done with completely grouped counts), or where (other listservs, for instance, to look for advice etc.   Thanks, Gene Magin
 Is this a school district where you can reach the person or persons who work with the data? Would they be willing to produce the rates you are interested in? What is the rationale for coarsening the data? If they released a new data set with the rate and the coarsened count, would it be possible to reverse engineer the original count? ----- Art Kendall Social Research Consultants
 In SPSS you can limit the outcome to within a particular range of the entire data for multiple imputation, but that won't take into account the potential range given the already binned data (eg 3 can be from 1-5, 8 can be from 6-10 etc). If you look on google scholar you can find some implementations that take that into account, https://scholar.google.com/scholar?hl=en&as_sdt=0%2C44&q=interval+censored+multiple+imputation&btnG=. Sometimes this data is called *interval censored* or *binned* data. It sort of depends on the type of analysis you want to do with the data how you might approach it. Simply descriptive you have bounds on the counts, and you can subsequently bound various summary statistics and simply tests of differences. If estimating as a dependent variable, you may do some type of censored regression approach. If using as an independent variable that is where I have seen imputation used. See https://prod.sandia.gov/techlib-noauth/access-control.cgi/2007/070939.pdf(has no references to the imputation stuff though). ----- Andy W apwheele@gmail.com http://andrewpwheeler.wordpress.com/
 Do you mean that the school district has raw data by student? Is the data provided as part of discovery? I know you may not be able to disclose some information, but is this an individual based case or part of a class action? Was any rationale given for coarsening the data? ----- Art Kendall Social Research Consultants
