Core Applications (Week 2-3)¶

Overview¶

Weeks 2-3 focus on building practical AI applications that solve real business problems. You'll learn advanced fine-tuning techniques, agent frameworks, and workflow automation to create production-ready systems.

Week 2: Customer Support Chatbot¶

Fine-tuning Fundamentals¶

Learn the difference between pre-training, fine-tuning, and prompt engineering. Understanding when and how to fine-tune effectively separates hobbyists from professional AI engineers.

The Training Hierarchy¶

Pre-training (Foundation Learning)¶

# Pre-training: Learning language from massive text corpora
# This happens once and costs millions of dollars

corpus = [
 "The internet contains billions of web pages...",
 "Machine learning is a subset of artificial intelligence...",
 "Customer service representatives help customers...",
 # ... billions more sentences
]

# Model learns:
# - Grammar and syntax
# - General knowledge 
# - Basic reasoning
# - Language patterns

# Result: A model that understands language but isn't specialized

Fine-tuning (Specialization)¶

# Fine-tuning: Adapting to specific tasks with smaller datasets
# This is what we do as AI engineers

task_specific_data = [
 {"input": "I need help with my order", 
 "output": "I'd be happy to help you with your order. Can you provide your order number?"},
 {"input": "My product is defective", 
 "output": "I'm sorry to hear about the defective product. Let's get this resolved for you..."},
 # ... thousands more examples
]

# Model learns:
# - Task-specific patterns
# - Domain vocabulary
# - Appropriate tone and style
# - Specific behaviors

Parameter-Efficient Fine-tuning (PEFT)¶

Modern fine-tuning techniques that update only a small subset of model parameters, offering significant advantages:

from peft import LoraConfig, get_peft_model

# Load base model
base_model = AutoModelForCausalLM.from_pretrained("microsoft/DialoGPT-medium")

# Configure LoRA (Low-Rank Adaptation)
lora_config = LoraConfig(
 r=16, # Rank of adaptation
 lora_alpha=32, # LoRA scaling parameter
 target_modules=["c_attn", "c_proj"], # Which layers to adapt
 lora_dropout=0.1, # Dropout for LoRA layers
)

# Apply LoRA to model
model = get_peft_model(base_model, lora_config)

# Output: trainable params: 294,912 || all params: 355,118,080 || trainable%: 0.08%

Benefits of PEFT¶

100x fewer trainable parameters - Dramatically reduces compute requirements
Faster training and less memory - Can train on smaller GPUs
Multiple adapters - One base model, many specialized tasks
Prevents catastrophic forgetting - Preserves original model capabilities

Customer Support Chatbot Project¶

Build a production-ready customer support chatbot with fine-tuned conversation AI, context handling, and role-based responses.

Key Features¶

Fine-tuned responses - Trained on specific FAQ dataset
Conversation context - Maintains history across turns
Escalation handling - Knows when to transfer to humans
Sentiment awareness - Adjusts tone based on customer emotion
Multi-turn conversations - Handles complex dialog flows

Data Preparation Workflow¶

def prepare_conversation_data(data_path):
 """
 Prepare conversational data for fine-tuning
 Expected format: JSON with customer/agent turns
 """

 with open(data_path, 'r') as f:
 conversations = json.load(f)

 # Convert to training format
 training_data = []

 for conversation in conversations:
 # Build conversation history
 context = ""
 for turn in conversation['turns']:
 if turn['speaker'] == 'customer':
 context += f"Customer: {turn['text']}\n"
 else: # agent
 # Create training example
 input_text = context + "Agent:"
 output_text = turn['text']

 training_data.append({
 'input': input_text,
 'output': output_text,
 'conversation_id': conversation['id']
 })

 context += f"Agent: {turn['text']}\n"

 return training_data

Evaluation Metrics¶

Perplexity - How well the model predicts responses
Response Quality - Human evaluation of helpfulness
Conversation Success Rate - Percentage of issues resolved
Escalation Rate - When chatbot transfers to humans

Week 3: Ask-the-Web Agent¶

Agent Frameworks and Tool Orchestration¶

Build AI agents that can access external tools and APIs to gather information, similar to Perplexity AI or research assistants.

Agent Architecture Patterns¶

Planner-Worker Pattern¶

class ResearchAgent:
 def __init__(self, llm, tools):
 self.llm = llm
 self.tools = tools
 self.memory = []

 def research(self, query):
 # 1. Plan the research approach
 plan = self.create_research_plan(query)

 # 2. Execute each step
 for step in plan:
 result = self.execute_step(step)
 self.memory.append(result)

 # 3. Synthesize findings
 report = self.synthesize_report(query, self.memory)
 return report

 def create_research_plan(self, query):
 prompt = f"""
 Break down this research query into specific steps:
 Query: {query}

 Available tools: web_search, summarize, fact_check

 Create a step-by-step plan:
 """

 response = self.llm.generate(prompt)
 return self.parse_plan(response)

Tool Integration with LangChain¶

from langchain.agents import Tool, AgentExecutor, LLMSingleActionAgent
from langchain.tools import DuckDuckGoSearchRun

# Define tools
search_tool = Tool(
 name="web_search",
 description="Search the web for current information",
 func=DuckDuckGoSearchRun().run
)

summarize_tool = Tool(
 name="summarize",
 description="Summarize long text content",
 func=summarization_pipeline
)

fact_check_tool = Tool(
 name="fact_check",
 description="Verify factual claims",
 func=fact_verification_function
)

tools = [search_tool, summarize_tool, fact_check_tool]

Web Search and Citation¶

class WebSearchAgent:
 def search_with_citations(self, query):
 # Search multiple sources
 search_results = self.multi_source_search(query)

 # Extract and rank information
 ranked_info = self.rank_by_relevance(search_results, query)

 # Generate response with citations
 response = self.generate_cited_response(ranked_info, query)

 return response

 def generate_cited_response(self, sources, query):
 prompt = f"""
 Answer the query using the provided sources. Include citations [1], [2], etc.

 Query: {query}

 Sources:
 {self.format_sources(sources)}

 Provide a comprehensive answer with proper citations.
 """

 return self.llm.generate(prompt)

n8n Workflow Automation¶

Integrate your AI agent with n8n to create automated workflows that connect to external services and databases.

Automated Research Pipeline¶

Trigger: Email with research request
Agent: Performs web research and analysis
Storage: Saves results to Google Sheets
Notification: Sends completion email with report

Workflow Configuration¶

{
 "nodes": [
 {
 "name": "Email Trigger",
 "type": "Gmail Trigger",
 "parameters": {
 "filters": {
 "subject": "Research Request:"
 }
 }
 },
 {
 "name": "AI Research Agent",
 "type": "HTTP Request",
 "parameters": {
 "url": "https://your-vm.ionos.com/research",
 "method": "POST",
 "body": {
 "query": "{{ $json.body }}"
 }
 }
 },
 {
 "name": "Save to Sheets",
 "type": "Google Sheets",
 "parameters": {
 "operation": "append",
 "sheetId": "your-sheet-id",
 "values": [
 "{{ new Date().toISOString() }}",
 "{{ $json.query }}",
 "{{ $json.report }}"
 ]
 }
 }
 ]
}

Ask-the-Web Agent Project¶

Build a Perplexity-style research agent with automated report generation and workflow integration.

Project Features¶

Multi-source search - DuckDuckGo, Tavily API integration
Intelligent summarization - Extract key information from web content
Citation tracking - Maintain source attribution
Automated workflows - n8n integration for reporting
Follow-up questions - Handle iterative research

Example Interaction¶

User: "What are the latest trends in renewable energy?"

Agent:
1. Searching for recent renewable energy developments...
2. Analyzing market reports and news articles...
3. Extracting key trends and statistics...
4. Generating comprehensive report with citations...

Report: Based on recent research, here are the key trends in renewable energy:

**Solar Power Growth**: Solar installations increased 25% in 2024, driven by 
decreasing costs and improved efficiency [1][2].

**Battery Storage Expansion**: Energy storage capacity doubled, making renewable 
energy more reliable [3][4].

**Green Hydrogen Development**: Major investments in hydrogen production for 
industrial applications [5][6].

Sources:
[1] International Energy Agency - Solar Report 2024
[2] Bloomberg New Energy Finance - Market Outlook
[3] Energy Storage Association - Annual Survey
[4] McKinsey Energy Insights - Battery Trends
[5] Hydrogen Council - Industry Report
[6] Clean Energy Wire - Market Analysis

Follow-up: Would you like me to dive deeper into any specific trend?

Key Learning Outcomes¶

After completing weeks 2-3, you will:

Master fine-tuning - Understand full fine-tuning vs PEFT techniques
Build conversational AI - Create context-aware chatbots for business use
Architect agent systems - Design multi-tool AI agents with proper orchestration
Integrate workflows - Connect AI to external services using n8n automation
Handle citations - Build trustworthy systems that provide source attribution
Deploy production systems - Scale applications for real-world usage

Technical Skills Developed¶

Parameter-Efficient Fine-tuning - LoRA, adapters, and PEFT techniques
LangChain Framework - Agent orchestration and tool integration
Web Search APIs - DuckDuckGo, Tavily, and content extraction
Workflow Automation - n8n configuration and API integration
Conversation Design - Context handling and multi-turn dialog
Information Synthesis - Combining multiple sources with proper attribution

Next Steps¶

With core applications mastered, you're ready for:

Advanced Techniques (Week 4-5) - Deep reasoning and multimodal AI
Capstone & Advanced (Week 6-7) - Independent projects and advanced topics

Resources¶

Hugging Face Fine-tuning Guide (opens new window) – Official fine-tuning documentation
LoRA Paper (opens new window) – Original LoRA research paper
LangChain Documentation (opens new window) – Agent frameworks and tool integration
n8n Documentation (opens new window) – Workflow automation platform