Friday, December 20, 2019

Limitations of Purple Air Observations

British statistician George Box once wrote "all models are wrong, some are useful."  When it comes to observations, I like to paraphrase that to "all observations are bad, some are useful."  This statement reflects the fact that all observations contain errors and uncertainty, but they can still be useful.

PurpleAir uses low-cost laser particle counters to estimate PM2.5 concentrations.  The sensors can be purchased and operated by anyone, with data available at  Many groups and individuals have installed PurpleAir sensors across the Salt Lake Valley, northern Utah, and other parts of the nation (and even world).  In the Salt Lake Valley, there is a remarkably high density of stations.  Below is a PurpleAir map of PM2.5 concentrations from 7:09 AM MST this morning.  With such a high density of stations, one can see some of the spatial variability in pollution, including the relatively low values of PM2.5 on the east bench compared to the central and northwest valley.

It should be noted, however, that while useful for examining the spatial patterns of pollution, PurpleAir sensors can have low absolute accuracy.  What this means is that while one can see that the east side has relatively clean air compared to the central and northwest valley in the map above, the actual values for PM2.5 concentrations may be off.  Kelly et al. (2017) examined the performance of PurpleAir sensors compared to research grade instruments and while they found good correlation, they also found that it overestimated particulate matter concentrations during cold air pools.  In other words, during what many Utahns refer to as inversions.  More recently, Tryner et al. (2020) also found PurpleAir sensors overestimated PM2.5  concentrations in the field. 

Data from PurpleAir sensors is now being used on local news broadcasts and by the National Weather Service.  However, the limitations of these observations needs to be recognized.  Last night, the National Weather Service tweeted that air quality was in the red across much of the Wasatch Front, Tooele Valley, and Cache Valley.  In their tweet, they included maps with PurpleAir observations. 

However, data from Utah Division of Air Quality sensors, as well as sensors operated by the University of Utah, showed PM2.5 concentrations to be much lower.  At Hawthorne Elementary, hourly PM2.5 concentrations peaked at 37 ug/m3, on the low end of unhealthy for sensitive groups.
Source: DAQ
Elsewhere last night, DAQ sensors in Davis County peaked at 30 ug/m3, Tooele County at 39 ug/m3 (although there was a spike to 54 at 11 AM), and Weber County at 23 ug/m3.  These observations are consistent with air quality in the moderate or unhealthy for sensitive groups depending on location.  In Cache County, PM2.5 concentrations were highest, but still DAQ sensors peaked at 47 ug/m3, still in the unhealthy for sensitive groups category. 
Thus, DAQ sensors did not indicate PM2.5 concentrations were as high as indicated by PurpleAir and the tweet issued by the National Weather Service that air quality was in the red was not consistent with DAQ observations.  

Finally, we could examine observations collected by the University of Utah on Trax Trains and at various sites in the valley.  Below is a map for the period from 6:29-7:29 AM this morning.  Again, values are lower than indicated by the PurpleAir sensors (compare with the first graphic in this post). 

I think that the PurpleAir network is wonderful in the sense that it helps us to identify the spatial patterns of pollution, but it is important that their tendency to overestimate PM2.5 concentrations be recognized by those communicating with the public.


  1. It's worth pointing out that observations on the Purple Air map that are circled in black denote *indoor* sensors, which tend to be cleaner. And, there are a lot more indoor sensors in the north/east part of the valley than in other areas.

    If only outdoor sensors are displayed, the pollution distribution is less extreme. Though, I do agree that the elevated areas of the valley are cleaner than the lowlands.

    1. Thanks. I should have noted that, although my comment about the clean east Bench was supported by the outdoor sensors near foothill and I-215 where several have readings ≤10.

  2. There is a conversion factor available in the legend on the PurpleAir map called AQandU created by Kerry Kelly specifically for PurpleAir sensor data for Salt Lake valley during the winter time.

    1. Thanks for pointing that out. I assume you mean this site: If so, I don't think what is being used for television broadcasts and by many people is the conversion factor data.