The 2MASS Extended Source Catalog (XSC) is comprised of near-infrared sources that are clearly resolved and extended beyond the beam/PSF. Sources range in angular size from 10´´ to 2°: from small, faint galaxies to the largest Local Group galaxies. By their very nature, galaxies are complex and, therefore, require characterization measurements which are much more detailed than that for point sources. As such, the XSC includes many different kinds of measurements and various flavors of, e.g., position, orientation, size, symmetry, surface brightness and integrated flux. The user should take care to understand these parameters and their limitations. Here we answer some "Frequently Asked Questions" and give some brief guidelines on how to use the XSC to maximum effect.
Fundamental: The XSC is an extended source catalog. Although galaxies dominate the total numbers (and we frequently refer to the XSC as a galaxy catalog), the XSC is also comprised of Milky Way entities, such as globular and open clusters, planetary nebulae, HII regions, ISM emission and nebulosity, young stellar objects and compact star-formation regions, and even Solar System comets. These Galactic sources tend to be confined to the plane of the Milky Way. Refer to the "All-Sky" maps in II.3c2 to see how these sources are distributed across the sky.
Fast links to: IRSA astrometry photometry large galaxies Milky Way objects completeness/reliability outliers references/links |
IRSA includes many services to retrieve data and to compare across data sets. The two most important services relevant to the XSC are:
The absolute astrometric accuracy of 2MASS is better than 100 milli-arcsec for point sources. For extended sources, you can expect 0.5" accuracy for the peak-pixel coordinates and 0.3" accuracy for the centroid coordinates. For a comparison of the 2MASS peak-pixel coordinates with the FIRST radio survey, see II.3d5.
For most applications, the elliptical isophotal aperture is a good choice, both in terms of capturing most of the integrated flux (~80-90%) and providing accurate colors for galaxies of all sizes. Choose the "total" apertures (e.g., Kron or the Extrapolation SB Profile) if you need integrated fluxes that reflect the total flux of the source. But beware, these apertures are vulnerable to stellar contamination and surface brightness irregularities. The photometry based on the extrapolation of the surface brightness profile seems to be the more robust of the two methods (primarily because stellar contamination is minimized by averaging over the azimuthal isophote(s) used to derived the median surface brightness profile.)
The most robust aperture in the XSC is the circular, 7" radial aperture. This is a good choice if you are focused on faint or small compact sources. But beware, for larger galaxies the small apertures are not a good choice -- they are subject to a "bulge" color bias. Conversely, for small galaxies the circularizing effect of the PSF alters the shape (orientation) and radial sizes -- beware.
Below are links to explanations of the aperture photometry available in the XSC:
That said, if the user so desires to re-capture the "default" mags that were used in the incremental releases, then target the circular (K-band fiducial) isophotal mags, which are available in the XSC.
The 2MASS survey acquired images of the sky using relatively small arrays. A "tile" or scan is 8.5' in angular width, and the typical overlap between scans is 50". Galaxies that are smaller than this overlap are guaranteed to be fully sampled in at least one survey scan. Larger galaxies may be truncated based on their proximity to a scan edge. Therefore it was necessary to construct an atlas of large objects made from "pieces" of adjoining scans. The net outcome is that we will fully recover galaxies that are currently "lost" or misrepresented in the 2MASS extended source pipeline.
The resultant set of mosaics and corresponding source characterizations are collectively called the 2MASS Large Galaxy Atlas (LGA). This LGA information has been incorporated into the XSC for the largest 550 galaxies in the sky. LGA galaxies are easily identified n the XSC with the cc_flg parameter; they are tagged with: cc_flg = "Z". We have also identified sources that are in close proximity to large galaxies, whose photometry has been eliminated from the catalog due to their unreliable nature: cc_flg = "z".
The 2MASS Large Galaxy Atlas is described in detail in Jarrett et al. (2003, AJ, 125, 525).
The Atlas also contains a handful of objects that are not very large, including low surface brightness galaxies, Seyferts and AGN. In addition to reconstruction of large galaxies, the spirit of the LGA is to recover galaxies that are either lost from the 2MASS automated pipeline because of (1) confusion, or (2) because of surface brightness. Confused objects (including pairs, groups and cluster cores) require special processing to deblend the individual component flux. Seyfert/AGN might require special processing to account for the extreme color difference between 2MASS bands. Finally, even though the galaxy may be large at optical wavelengths, in the near-infrared the object might be quite small or invisible (e.g., extreme late-type galaxies). An example of a very small galaxy reconstructed in the Large Galaxy Atlas is IRAS 07598+6508 (Figure 1).
![]() |
Figure 1 |
We combine and exploit this multi-dimensional parameter space using a "decision tree" method. We build the decision tree training sets to do one thing only: separate the extended (resolved) sources from the point-like sources. It does nothing else. This method cannot (and will not) tell you (1) how extended an object is, or said another way, it will not tell you the probability of extendedness or otherwise, and (2) it will not tell you the nature of the sources tested against the decision tree. It can only tell you if the source is extended or if it is not.
That said, we do attempt something a bit more useful to our needs. The decision tree method is applied to the 2MASS parametric data for each band, independently. Although the band-to-band information might be correlated to some degree, by combining the decision tree results for each band we create a "pseudo-probability". We can "assign" a pseudo-probability by using a weighted average of the decision tree classifications for each band. This probability has a value between 1 and 2, with 1 being "extended" and 2 being "point-like". But owing to the correlated nature of source characterization between 2MASS bands, one should not read too much into small differences (e.g., there is virtually no difference between a value of 1.0 and 1.2). Two different probability flavors are generated: (1) g_score and the (2) e_score. These are employed as the final arbiter for star-galaxy separation. It turns out that a value of around 1.4 gives a satisfactory separation, while maintaining satisfactory completeness. Summary: extended objects have scores between 1 and 1.4, while point-like objects have values between 1.4 and 2.0.
e_score
The "e_score" is a simpler version of the g_score: it has only three parameters that
are tested in the decision trees, including surface brightness, shape and the
double/triple star
discriminator. It does not include a color attribute. In this sense, it is less biased
than the g_score. It was conceived to capture Milky Way objects that might have unusually
blue colors, or shapes distinctly different from galaxies. The XSC is drawn from a database sample that
has an e_score or g_score value less than 1.4.
Note that the Level-1 Requirements are only relevant to galaxies and the g_score (which is the most reliable discriminator anyway). See VI.5b.ii.
To summarize:
The 2MASS XSC includes sources that belong to the Milky Way: H II regions, stellar clusters, planetary nebulae, young stellar objects, emission-line nebulae, reflection nebulae and solar system comets. Nearly all Milky Way objects are tightly confined to the Plane of the Galaxy: |glat| < 5°. The exception to this rule are the giant molecular clouds that float above the Plane (e.g., Orion, Taurus, Rho Ophiuchus), and nebulosity in the LMC and SMC. You can see the Milky Way objects quite easily in the XSC Allsky image, lining the Plane in color "red". These objects are both intrinsically red (e.g., HII regions are dominated by emission bands in the 2 micron window) and dust-reddened. Although it is not possible uniquely to identify Milky Way objects in the XSC, most exhibit the following characteristics:
We have identified some 8,000 sources in the XSC that are probably Milky Way in origin, a smaller subset of 3712 sources are highly probably Galactic in nature. The smaller table of Milky Way sources is given in Table 1.
What is missing from the XSC? In the near-infrared, particular kinds of galaxies are nearly invisible, including late-types (Sd, Sdm, Im), blue compacts and LSBs; see the 2MASS near-infrared galaxy morphology sequence. The reason is that the near-infrared window is sensitive to giant stars and the older stellar populations that comprise the "backbone" of galaxies. It is not sensitive to massive star formation regions and hot blue stars, which might dominate the emitted light for late-type disk galaxies. A comparison between the optical and near-infrared windows is given in 2MASS Large Galaxy Atlas. In particular, see the discussion in sections 6.4, 6.5, 6.6 and 6.7. The end result is that catalog comparisons between the optical and NIR will show significant differences in completeness, surface brightness and color.
Galaxies are also missed due to confusion from nearby stars. The automated pipeline might peak up on the star, thus reducing the "extendedness" of the source, thereby excluding it from the XSC. Galaxies might also be missed due to bright star masking, where the confusion radius, diffraction spike or horizontal strip masking either eliminates the source completely or disrupts it to the point that it is quality-rejected from the XSC. Stellar confusion can also render the position of the "star+galaxy" to be inaccurate (the position may in fact correspond to the star itself, or the centroid of the blended object).
Another thing to consider is the position of your source. Occasionally the published position of a galaxy is quite inaccurate (tens of arcsec). The source may yet be in the XSC, but the user needs to search a larger radius to properly catch it.
Your object may belong to the 2MASS Large Galaxy Atlas, where oddball galaxies and invisible galaxies are recovered (in addition to large galaxies).
If you decide that your missing galaxy should have been detected by 2MASS (considering the arguments given above), then send the coordinates of the object to the 2MASS group (re: T. Jarrett), and we'll attempt recovery of your object.
In the Plane where source confusion reigns, triple stars are a major source of unreliability, particularly for faint XSC sources. The Level-1 Science Requirements do not apply to the Galactic Plane (|glat| < 10°); however, the reliability appears to be at least 80-90%, depending on the source confusion (as measured with the density metric.) The XSC stellar contamination is summarized in VI.5b.ii.
Other potential problems include (1) corruption from nearby bright stars (e.g., Beta Pegasi and extended source artifacts) and (2) airglow gradients in the background. Both of these phenomena can result in corrupted (or contaminated or modified) fluxes in one or more bands.
![]() |
![]() |
Figure 2 | Figure 3 |
2MASS fully covered the sky, providing a uniform census of galaxies in the local Universe. The XSC is only limited by (1) masking of bright stars, and (2) confusion noise in the Plane. Bright star masking is fully tracked and tabulated for the XSC on scales set by the 2MASS Atlas images (8.5 X 17 arcmin). The bottom line is that the spatial coverage is typically better than 98% for most of the sky. See the coverage maps here: Extended Source Spatial Coverage. A set of tan-projection FITS images are available for download at this link: II.6.f.
A consequence of squeezing the allsky into equal-area aitoff projections is that the map edges take on distortions that may look unphysical. For example, notice the streak-like structure in the lower right-hand portion of the equatorial map, corresponding to a coordinate: RA=20h 04m, Dec = -55d 56m. Here is a zoom (Figure 4) of this map region. Re-projecting the area using a standard "flat" tangent projection reveals the following (Figure 5): the field includes several known (catalogued) galaxy clusters, many of which are aligned along a diagonal to the equatorial plane. Note the prominent galaxy clusters at 19h 50m, -55d (Abell 3651) and 20h 12m, -56.8d (Abell 3667), both located at a redshift corresponding to 0.05 to 0.06.
![]() |
![]() |
Figure 4 | Figure 5 |
[Last Updated: 2015 Oct 21; by T. Jarrett and R. Cutri]