Course:FRE521D

FRE 521D: Data Analytics in Climate, food and Environment
FRE 521D
Section:
Instructor:	TBD
Email:	TBD
Office:	TBA
Office Hours:	TBA
Class Schedule:	Feb 24 to April 11 Tue/Thu from 10:00- 11:30 AM
Classroom:	MCML 154
Important Course Pages
	Syllabus
	Lecture Notes
	Assignments
	Course Discussion

COURSE DESCRIPTION

In this course we will explore data and try to find patterns. We will look at the field of data science, using R, SQL, and Python to move through a data pipeline, from database, through some data wrangling, data visualization and inference.

LEARNING OUTCOMES

1. Machine Learning Fundamental

Terminology and Building Models
Machine learning approaches: regression, classification, clustering, decision trees and random forest.
Best and worst practices of managing machine learning projects.
Machine Learning pipeline development and value
Application: Climate Example: Assessment of ESG compliance by firms

2. SQL

Intro ( significance of SQL in data analysis and the modern job market, Understanding databases: Why are they essential?)
Databases

Different types of databases: Relational vs. Non-relational
Introduction to RDBMS (Relational Database Management System) structure

Intro to SQL
Basic SQL Queries ( ‘Select’, ‘Distinct’, ‘Where’, ‘Order by’, ‘Limit’
Functions and aggregation

Basic functions: ‘count()’, ‘sum()’, ‘AVG()’, ‘MIN()’ ,‘Max()’
Grouping data using ‘GROUP BY’
Filtering on aggregated data using ‘HAVING’

Joins and relationship.
Best and worst practices of managing machine learning projects.
Application: Climate Example: https://wbwaterdata.org/dataset/waterbase-water-quality.

3. Data Access and Querying on Cloud Based Platforms: API methods.

Data formats of CSV, JSON, or XML and how to accesss
Data wrangling, data cleaning, and preliminary visualization with Python
Advantages and disadvantages of the major data formats
Determining the best match between data formats and select analysis.

REAL-WORLD APPLICATIONS IN CLIMATE, FOOD & ENVIRONMENT

Machine Learning is the tool for analyzing big data and increasingly food and environment application have a big data component. This course will apply the data tools to areas such as ESG assessment, climate impact, and business ratings.

Career Relevance: Highlight how the course aligns with prospective career paths, emphasizing real-world applicability.

Businesses such as banks (lending assessment), agribusinesses (precision ag) are increasingly reliant on SQL and machine learning for assessments and predictive analytics.

Major project provides the opportunity to practice and data skills to a real-world data set.

How can we transition to making data-driven decisions?
How to use data and numbers to tell a story which will influence decision makers?

ASSESSMENT METHODS


Activity	% of Grade
Assignments / Labs	20%
Project	30%
Quizz	10%
Participation	10%
Midterm	30%
Total	100%