The image above is a map of the stars constructed by the European Space Agency’s Gaia space telescope. It reportedly shows 1,000,000,000 stars.1
Gaia data are available in CSV form at this site. A codebook is here
Download one of the CSV files and see what you can make of it. For instance, …
Stars_042 <- readr::read_csv("Data/GaiaSource_000-000-042.csv.gz")
.csv
in the name. What does the .gz
mean at the end of the file name?.csv.gz
files available, estimate how many stars there are in the complete catalog.phot_g_mean_flux
as the intensity and ecl_lon
and ecl_lat
as the position variables.A simple plot:
Stars_042 %>%
sample_n(size = 10000) %>%
ggplot(aes(x = ecl_lon, y = ecl_lat)) +
geom_point(aes(size = phot_g_mean_flux,
color = phot_g_mean_flux,
alpha = phot_g_mean_flux))
Stars_042 <-
Stars_042 %>%
mutate(facet = round(log10(phot_g_mean_flux)),
color = log10(phot_g_mean_flux),
size = log10(phot_g_mean_flux)/20)
Stars_042 %>%
sample_n(size = 1000) %>%
ggplot(aes(x = ecl_lon, y = ecl_lat)) +
geom_point(size = 0.5, aes(color = color, alpha = size)) +
facet_wrap( ~ facet)
7. Is there a relationship between the ra
and dec
variables and the ecl_lon
and ecl_lat
variables? Try different ways assigning the variables to aesthetics until you find one that tells the story.
Stars_042 %>%
sample_n(size = 1000) %>%
ggplot(aes(x = ecl_lat, y = dec)) +
geom_point(size = 0, aes(color = ecl_lon))
ra
/dec
coordinate system and the ecl_lat
/ecl_lon
system. Suggestion: Pull out only those stars that fall within a narrow band of the edges of a square in one of the coordinate systems. Then make separate plots of those stars in the two systems, perhaps using color to encode which stars in one plot correspond to stars in the other plot.See this story on the BBC web site.↩