Course:FRE521D

From UBC Wiki


FRE 521D: Data Analytics in Climate, food and Environment
Wiki.png
FRE 521D
Section:
Instructor: TBD
Email: TBD
Office: TBA
Office Hours: TBA
Class Schedule: Feb 24 to April 11

Tue/Thu from 10:00- 11:30 AM

Classroom: MCML 154
Important Course Pages
Syllabus
Lecture Notes
Assignments
Course Discussion


COURSE DESCRIPTION

In this course we will explore data and try to find patterns.  We will look at the field of data science, using R, SQL, and Python to move through a data pipeline, from database, through some data wrangling, data visualization and inference.  

LEARNING OUTCOMES

1. Machine Learning Fundamental

  • Terminology and Building Models
  • Machine learning approaches: regression, classification, clustering, decision trees and random forest.
  • Best and worst practices of managing machine learning projects.
  • Machine Learning pipeline development and value
  • Application: Climate Example: Assessment of ESG compliance by firms

2. SQL

  • Intro ( significance of SQL in data analysis and the modern job market, Understanding databases: Why are they essential?)  
  • Databases  
  • Different types of databases: Relational vs. Non-relational
  • Introduction to RDBMS (Relational Database Management System) structure
  • Intro to SQL  
  • Basic SQL Queries ( ‘Select’, ‘Distinct’, ‘Where’, ‘Order by’, ‘Limit’
  • Functions and aggregation  
  • Basic functions: ‘count()’, ‘sum()’, ‘AVG()’, ‘MIN()’ ,‘Max()’
  • Grouping data using ‘GROUP BY’
  • Filtering on aggregated data using ‘HAVING’
  • Joins and relationship.
  • Best and worst practices of managing machine learning projects.
  • Application: Climate Example: https://wbwaterdata.org/dataset/waterbase-water-quality.

3. Data Access and Querying on Cloud Based Platforms: API methods.

  • Data formats of CSV, JSON, or XML and how to accesss
  • Data wrangling, data cleaning, and preliminary visualization with Python
  • Advantages and disadvantages of the major data formats
  • Determining the best match between data formats and select analysis.

REAL-WORLD APPLICATIONS IN CLIMATE, FOOD & ENVIRONMENT

Machine Learning is the tool for analyzing big data and increasingly food and environment application have a big data component. This course will apply the data tools to areas such as ESG assessment, climate impact, and business ratings.

Career Relevance: Highlight how the course aligns with prospective career paths, emphasizing real-world applicability.

Businesses such as banks (lending assessment), agribusinesses (precision ag) are increasingly reliant on SQL and machine learning for assessments and predictive analytics.

Major project provides the opportunity to practice and data skills to a real-world data set.

  • How can we transition to making data-driven decisions?
  • How to use data and numbers to tell a story which will influence decision makers?

ASSESSMENT METHODS

Activity % of Grade
Assignments / Labs 20%
Project 30%
Quizz 10%
Participation 10%
Midterm 30%
Total 100%