Overview
This section documents how to build a small, specialist LLM fine-tuned for a specific domain and deployed locally on edge hardware — with no cloud dependency at inference time. The canonical example throughout is a network-engineer expert model: an LLM that understands Juniper and Cisco device semantics, can translate English intent into device commands, troubleshoot from status output, and interpret standard network diagnostic tools.
The section covers:
- How LLMs are trained from scratch and what fine-tuning actually modifies
- Efficient fine-tuning methods (LoRA, QLoRA) that run on consumer hardware
- What training data looks like, how to source it, and how to format it correctly
- Compute options: Apple Silicon locally vs. rented GPU instances
- The specific data types needed to produce a credible network-domain expert
Subtopics
- Fundamentals — LLM training mechanics and fine-tuning methods
- Training Data — Data sourcing, formatting, and quality
- Compute Options — Apple Silicon, cloud GPUs, cost and time estimates
- Specialist Models — Network engineer expert models: Juniper and Cisco