11.1 Exercise 5: barplot

  1. Import DataViz_source_files-main/files/population_age_group_per_sex_long_format.csv into an object called pop_age.
correction
pop_age <- read_csv("DataViz_source_files-main/files/population_age_group_per_sex_long_format.csv")


  1. Create a barplot that shows the number of people per Year:
correction
ggplot(data=pop_age, mapping=aes(x=Year, y=Population)) + 
  geom_bar(stat="identity")


  1. Split by Sex (in the same plot: do not split into different plots):
correction
ggplot(data=pop_age, mapping=aes(x=Year, y=Population, fill=Sex)) + 
  geom_bar(stat="identity")


  1. Filter the data to keep only “Spain” in “Country”:
correction
pop_age_spain <- filter(pop_age, Country=="Spain")
ggplot(data=pop_age_spain, mapping=aes(x=Year, y=Population, fill=Sex)) + 
  geom_bar(stat="identity")


  1. Show the bars next to each other.
correction
ggplot(data=pop_age_spain, mapping=aes(x=Year, y=Population, fill=Sex)) + 
  geom_bar(stat="identity", position="dodge")


  1. Change default colors using scale_fill_manual() layer.

The structure is not easy to remember: take a peek at the answer!

correction
ggplot(data=pop_age_spain, mapping=aes(x=Year, y=Population, fill=Sex)) + 
  geom_bar(stat="identity", position="dodge") + 
  scale_fill_manual(values=c(Female="darkgreen", Male="red"))


  1. Add a title, change the theme / background:
correction
ggplot(data=pop_age_spain, mapping=aes(x=Year, y=Population, fill=Sex)) +
  geom_bar(stat="identity", position="dodge") + 
  scale_fill_manual(values=c("darkgreen", "red")) + 
  ggtitle("Spanish population from 1950 to 2021 per sex") + 
  theme_minimal()


8. Now split the plot by Age group using facet_wrap()

correction
ggplot(data=pop_age_spain, mapping=aes(x=Year, y=Population, fill=Sex)) +
  geom_bar(stat="identity", position="dodge") + 
  scale_fill_manual(values=c("darkgreen", "red")) + 
  ggtitle("Spanish population from 1950 to 2021 per sex") + 
  theme_minimal() +
  facet_wrap(~Age_group)


  1. Save to a PDF file: Export -> Save as PDF. Note: when saving to PDF format, you cannot manualy drag the plot to get the size and proportion you want, but you can Preview before saving it.

If you prefer, you can save the file using ggsave() (or you can do both!).

correction
# first, save to an object
plot_age <- ggplot(data=pop_age_spain, mapping=aes(x=Year, y=Population, fill=Sex)) +
  geom_bar(stat="identity", position="dodge") + 
  scale_fill_manual(values=c("darkgreen", "red")) + 
  ggtitle("Spanish population from 1950 to 2021 per sex") + 
  theme_minimal() +
  facet_wrap(~Age_group)

# save file
ggsave(filename="population_age_sex_Spain.pdf", plot=plot_age, device = "pdf", units = "in", width = 10, height = 8)


  1. Save your filtered data as a .csv file: have a look at the write_csv() function from {readr}:
correction
write_csv(pop_age_spain, "subset_spain.csv")