Column definition questions #12

keflavich · 2022-11-04T19:06:47Z

Another documentation question.

What are all the column names in the returned catalog?

crowdsource/crowdsource/crowdsource_base.py

Lines 954 to 960 in d0bb2eb

    
           return OrderedDict([('dx', posunc[0]), ('dy', posunc[1]), 
        
                               ('dflux', fluxunc), 
        
                               ('qf', qf), ('rchi2', rchi2), ('fracflux', fracflux), 
        
                               ('fluxlbs', fluxlbs), ('dfluxlbs', dfluxlbs), 
        
                               ('fwhm', fwhm), ('spread_model', spread), 
        
                               ('dspread_model', dspread), 
        
                               ('fluxiso', fluxiso), ('xiso', xiso), ('yiso', yiso)])

Some are pretty obvious:

dx, dy: Positional uncertainties
dflux: Flux uncertainty
rchi2: reduced chi^2, where the 'degrees of freedom' is set by qf
fwhm: 2nd-moment-based FWHM estimate

What are:

the qf quality factor? It seems to be the integral over the PSF for each star including only "good" pixels. Is there any rule-of-thumb for what is a 'good enough' number?
fracflux: I don't know what impsf is; is it the image convolved with the PSF? Still not sure what this statistic tells me.
fluxlbs, dfluxlbs: What does "LBS" stand for?
fluxiso, xiso, yiso: I can't quite tell what the compute_iso_fit function is doing in a quick read; it looks like it's performing a least-squares fit between the PSF and the data allowing only for amplitude and shift variation? But I might be misreading it
spread_model, dspread_model: this is kinda documented, but I'm not familiar with it. Sounds like it's useful for galaxy measurements?

I'm sure some of this is documented in the literature in your or others' papers, but even if so, it would be helpful to have a statement of what is intended by the code so I don't mis/overinterpret what it's doing.

The text was updated successfully, but these errors were encountered:

schlafly · 2022-11-04T19:28:09Z

Yes, there's some documentation in the papers and online here:
http://decaps.skymaps.info/catalogs.html
obviously in the context of decaps.

For what it's worth I don't see the FWHM as derived from the second moment, but instead of as derived from the effective number of pixels in the PSF, and then transformed to what you would get for a Gaussian for the equivalent effective number of pixels.

Yes, qf is what you describe. For unsaturated sources, I'd be deeply skeptical of anything with qf < 0.6 or so; the suggestion is that we're on the edge of a chip or a bad region and don't even have the peak on a good pixel. I'd put tighter bounds if I wanted very good photometry, more like 90-95%.

fracflux is intended to be a measure of how blended the source is. It's the PSF-weighted flux of the stamp after subtracting neighbors, divided by the PSF-weighted flux of the full image including neighbors. So if you have no neighbors around, it's 1. If typically half the flux in one of your pixels is from your neighbors, it's 0.5, where 'typically' is in a PSF-weighted sense.

The lbs quantities are "local background subtracted" quantities, where I've repeated the fit on the neighbor subtracted image with a new sky pedestal. They haven't proven valuable. Were they significantly different from the other fluxes, that would be a sign to worry.

The documentation on the decaps webpage for the iso quantities is:
"flux derived from linear least squares fit to neighbor-subtracted image; significant difference from ordinary flux indicates a convergence issue" which is a pretty good summary of how I feel about it. There's really not anything fundamentally different about it than about the normal fluxes, except that it comes out of something simple rather than a big linear least squares package.

I think of spread_model as a simple size estimator. decaps webpage documentation is "sextractor-like spread_model; positive means the source is broader than a PSF". Here's the sextractor documentation:
https://sextractor.readthedocs.io/en/latest/Model.html#model-based-star-galaxy-separation-spread-model

keflavich · 2022-11-04T19:30:50Z

Thanks, that's extremely helpful. The fracflux especially, I completely did not appreciate that it was measuring overlap.

keflavich · 2022-11-04T19:45:09Z

Well, the LBS is super helpful in some cases. This is an example I have:

where I believe the "flux" measurement has gone totally off the rails. This is likely a separate issue, but I've had this problem occur with subtle changes in the weights. The "flux" values are definitely wrong. The LBS flux looks less wrong.

For comparison, this is what it should look like:

schlafly · 2022-11-04T20:39:01Z

Yeah, I would want to know more there. Not knowing anything, it looks like the sky isn't being subtracted, leading everything to have a significant positive flux. Clearly the local background subtraction is helping, but I don't really think it should; something else is also going on.

keflavich · 2022-11-04T20:40:27Z

yes, that's exactly right - it took me a while to figure this out, but the sky model is very bad when the weights are subtly changed. Any idea what would cause that? I'm still digging to diagnose, but your instincts are certainly more finely tuned than mine.

keflavich · 2022-11-04T20:53:06Z

I'm using nskyx=nskyy=1, which iiuc, is a simple constant model for each star cutout? Maybe I should be using something higher-order, but if it's not working for 0 or 1st order, I'm wary of doing so.

schlafly · 2022-11-05T13:30:59Z

Try nskyx=nskyx=0. That's what we use in DECaPS and probably unWISE.

There are two contributions to the sky model: a linear one that is fit simultaneously with all of the stars, and a higher spatial order one that's just a median in different cells. We rely on the latter when nskyx=nskyx=0. nskyx=nskyx=1 means a pedestal is fit for the full image, not each stamp.

The former is nice in the blended limit in that it lets the sky get fainter and all of the stars get brighter simultaneously; i.e., it speeds convergence. But it couples all of the pixels together, so one bad pixel can ruin the analysis of a whole image. In your case, I'd guess you have some pixels with value = 0 or -infinity or just really implausibly low given the uncertainties and the true sky background. And the code is minimizing chi^2 by badly undersubtracting the sky to accommodate them.

So I'd guess you need to either mask them (ivar = 0) or get rid of the global sky fit so they don't ruin the whole image.

keflavich · 2022-11-05T14:30:42Z

Thanks, that's again very helpful and clear. I bet there are some zeros, or near-zeros (1e-N with N>5?), in my data that I did not mask out. So I'm going to try masking that out, but I'll also try different orders.

schlafly · 2022-11-05T16:59:43Z

The fitting gives you the model image, so the diagnostic I usually use is chi = (data-model)/sigma. I'd expect you'd find a small number of pixels have very large negative chi, yeah.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Column definition questions #12

Column definition questions #12

keflavich commented Nov 4, 2022

schlafly commented Nov 4, 2022

keflavich commented Nov 4, 2022

keflavich commented Nov 4, 2022

schlafly commented Nov 4, 2022

keflavich commented Nov 4, 2022

keflavich commented Nov 4, 2022

schlafly commented Nov 5, 2022

keflavich commented Nov 5, 2022

schlafly commented Nov 5, 2022

Column definition questions #12

Column definition questions #12

Comments

keflavich commented Nov 4, 2022

schlafly commented Nov 4, 2022

keflavich commented Nov 4, 2022

keflavich commented Nov 4, 2022

schlafly commented Nov 4, 2022

keflavich commented Nov 4, 2022

keflavich commented Nov 4, 2022

schlafly commented Nov 5, 2022

keflavich commented Nov 5, 2022

schlafly commented Nov 5, 2022