AITC Wiki

DTS002TC Essentials of Big Data

DTS002TC Essentials of Big Data

中文版:DTS002TC 大数据基础

This course provides a comprehensive introduction to big data essentials, covering Python programming fundamentals, numerical computing with NumPy, data manipulation with Pandas, data visualization with Matplotlib, and introductory machine learning using Scikit-Learn.

Course Overview

DTS002TC is an introductory course on big data essentials at Xi’an Jiaotong-Liverpool University. The course covers both theoretical foundations of big data and practical programming skills using Python and its data science ecosystem.

Lectures

#TopicMaterials
1General Introduction to Big DataLecture 1
2Technical Aspects of Big DataLecture 2
2+Intro to GPUGPU Intro
3aData Detectives - CIKWCIKW
3bStorage and Treatment of Big DataLecture 3
4Analysis of Big DataLecture 4
5Computer Vision and Big Data AnalysisLecture 5
5+Coursework 1 (SJL)CW1
Day2Introduction to PythonPython Intro
Day2Matplotlib and Machine LearningMatplotlib & ML

Labs

Lab 1: Python Fundamentals

Lab 2: Python Core Concepts

Lab 3: NumPy Basics

Lab 4: NumPy Advanced Arrays

Lab 5: NumPy Computation

Lab 6: Boolean Arrays & Pandas

Lab 7: Data Visualization

Lab 8: Machine Learning with Scikit-Learn

Lab 9-10: Final Practice

Review

Sources

All materials sourced from raw course files in raw/DTS002/.

此文件夹下有2条笔记。