What models can I fine-tune with this skill?

The skill supports a wide range of popular models including Llama 3.3, 3.2, and 3.1, Mistral v0.3, Phi 3.5, Gemma 2, and Qwen 2.5.

What export formats are supported after training?

You can export your fine-tuned models to several formats including GGUF (for local LLM runners), Ollama, vLLM, and standard Hugging Face safetensors.

Can I use Unsloth on a consumer-grade GPU?

Yes, Unsloth is specifically optimized for efficiency, allowing you to train 1B models on GPUs with as little as 8GB VRAM and 7B-8B models on 12GB-16GB VRAM.

Does this skill help with CUDA 'Out of Memory' errors?

Absolutely. It includes specific patterns for memory optimization, such as adjusting gradient accumulation steps, reducing sequence length, and utilizing 4-bit quantization to fit larger models on smaller hardware.

Unsloth LLM Fine-Tuning

Name: Unsloth LLM Fine-Tuning
Author: ScientiaCapital

byScientiaCapital

•

Data Science & ML

Accelerates LLM fine-tuning by 2x while reducing memory consumption by 80% for models like Llama, Mistral, and Phi.

This skill provides Claude with expert capabilities to implement and manage LLM fine-tuning using the Unsloth library. It enables users to perform memory-efficient 4-bit quantization, configure LoRA/QLoRA adapters, and manage the entire training lifecycle from dataset integration to final model export. Whether you are working on consumer-grade hardware or enterprise GPUs, this skill helps optimize hyperparameters to prevent CUDA out-of-memory errors and facilitates exporting models into production-ready formats like GGUF, Ollama, and vLLM.

Key Features

01Automatic 4-bit quantization and gradient checkpointing

02Seamless export to GGUF, Ollama, and Hugging Face formats

031 GitHub stars

042x faster training speeds with 80% VRAM reduction

05Support for Llama 3.3, Mistral, Phi 3.5, and Gemma 2

06Hardware-specific performance optimization profiles

Use Cases

01Creating custom GGUF models for local deployment in Ollama

02Fine-tuning Llama 3 models on consumer GPUs like the RTX 3060

03Optimizing training pipelines for low-latency domain adaptation

What are Skills?·How to Install

Install with 🐟 Skill.Fish

npx skillfish add scientiacapital/unsloth-mcp-server unsloth-finetuning

For use in Claude.ai and ChatGPT

Download Skill