AI012 Professional

Deep Dive into Large Language Models

This course provides a comprehensive and in-depth introduction to the development history of large language models (LLMs), their core technical architectures, training paradigms (pretraining, fine-tuning, and alignment), multimodal extensions, prompt engineering, chain-of-thought reasoning, agents, as well as frontier topics such as model safety and privacy protection.

4.9

24.0h

1067 students

0 likes

Artificial Intelligence

Start Learning

Lessons

Lesson

1 Lesson 1

This lesson explores the paradigm shift from task-specific AI to general-purpose Large Language Models, highlighting how scaling laws and the Transformer architecture enable emergent reasoning capabilities. Students will learn how the self-attention mechanism allows models to process data in parallel and why decoder-only architectures have become the standard for modern generative AI.

2 Lesson 2

This lesson explores the evolution of Transformer architectures, highlighting why the industry has shifted toward Decoder-only models due to their superior scaling laws and generative capabilities. It also examines the foundational training pipeline, covering the transition from pre-training and Chinchilla optimality to modern instruction tuning and alignment techniques.

3 Lesson 3

This lesson explores the evolution of prompting strategies, ranging from zero-shot instructions to few-shot demonstrations and Chain-of-Thought reasoning for complex logic. Students will learn how to improve model performance and reliability by using structural constraints, logical decomposition, and precise formatting to ensure outputs are suitable for programmatic use.

4 Lesson 4

This lesson explores the evolution of reasoning in large language models, moving from linear Chain-of-Thought to advanced structured architectures like Tree-of-Thought and Graph-of-Thought. Students will learn how these frameworks, alongside techniques like Program of Thought and knowledge editing, enable models to perform deliberate planning, verification, and complex mathematical problem-solving.

5 Lesson 5

This lesson explores the challenges of AI safety, focusing on the training paradox between model helpfulness and harmlessness, as well as the mechanics of adversarial jailbreak attacks. It also introduces statistical watermarking techniques, such as the KGW framework, which use vocabulary partitioning and logit bias to embed detectable, invisible signatures into AI-generated content.

6 Lesson 6

This lesson explores the fundamental conflict in AI alignment between maintaining model helpfulness and ensuring harmlessness, highlighting how jailbreak attacks exploit this tension through role-playing and cognitive coercion. Students will learn how these vulnerabilities function and examine technical strategies, such as intent analysis and robust system prompting, to defend against prompt-based bypass attempts.

7 Lesson 7

This lesson explores the evolution of Multi-modal Large Language Models (MLLMs) from vision-centric systems to integrated architectures capable of processing diverse sensory inputs like audio and 3D data. Students will learn how specialized encoders and projection bridges align non-textual signals into a unified semantic space, enabling the model to effectively reason across multiple modalities.

8 Lesson 8

This lesson explores the development of autonomous GUI agents that use a tripartite architecture—planning, decision-making, and reflection—to interact with software interfaces. It further examines how Reinforcement Learning and RLHF enable these agents to adapt to dynamic environments while maintaining safety and reliability against potential risks.

Course Overview

📚 Content Summary

This course provides a comprehensive and in-depth introduction to the evolution of Large Language Models (LLMs), core technical architectures, training paradigms (pre-training, fine-tuning, and alignment), multimodal extensions, prompt engineering, Chain of Thought (CoT), agents, as well as frontier topics such as model safety and privacy protection.

Deep analysis of the technological evolution and safety alignment of full-stack large models, from pre-training to general agents.

🎯 Learning Objectives

Distinguish between model architectures: Identify the structural differences and use cases for Encoder-only, Decoder-only, and Encoder-Decoder models.
Explain the LLM Training Pipeline: Describe the transition from self-supervised pre-training to Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF).
Analyze Model Scaling and Behavior: Explain the concepts of Scaling Laws, Emergent Abilities (In-context learning, Chain of Thought), and the phenomenon of Hallucinations.
Analyze the structural differences between Encoder-only (BERT), Decoder-only (GPT), and Encoder-Decoder (T5) architectures.
Explain the three-stage training process: Pre-training (Base model), Instruction Tuning (SFT), and Alignment (RLHF/PPO).
Compare the performance, scaling laws, and architectural innovations of mainstream LLMs including GPT, Llama, Qwen, and DeepSeek.
Implement zero-shot and few-shot prompting strategies for structured data extraction and classification.
Calibrate model hyperparameters (Temperature, Top P, Penalties) to balance creative and deterministic outputs.
Construct effective Chain-of-Thought (CoT) prompts using manual, automatic, and zero-shot ("Let's think step by step") methods.
Analyze and Compare CoT Variants: Differentiate between Self-Consistency, Program of Thought (PoT), Tree-of-Thought (ToT), and Graph-of-Thought (GoT) architectures.