granitepi

Granite-Pi

Author: Julian A. Gonzalez, IBM Champion 2025
Date: 6-29-2025

IBM Granite + Raspberry Pi

Overview: A beginner-friendly guide to running IBM’s Granite 3.2-2B language model on Raspberry Pi 5 using only built-in hardware. Create a local AI assistant with CPU-only processing.

πŸš€ Key Features

βš™οΈ Hardware Requirements

| Component | Minimum | Recommended | |———–|β€”β€”β€”|β€”β€”β€”β€”-| | Raspberry Pi | Pi 5 (4GB) | Pi 5 (8GB) | | Storage | 64GB Class 10 microSD | Higher speed card | | Power Supply | Standard USB-C | 27W Official PSU | | Cooling | Heatsink | Active cooling case | | Network | Wi-Fi | Ethernet connection |

πŸ“₯ Installation Steps

  1. Set up Raspberry Pi OS
    • Flash 64-bit Raspberry Pi OS using Imager
    • Enable SSH and configure Wi-Fi during setup
    • Update system: sudo apt update && sudo apt upgrade -y
  2. Install Ollama
    curl -fsSL https://ollama.com/install.sh | sh
    
  3. Optimize Performance
    • Set CPU governor to performance mode
    • Configure Ollama to use all 4 CPU cores
  4. Download Granite Model
    ollama pull granite3.2:2b
    # Alternative custom setup if needed
    ollama create granite-pi -f granite-pi.Modelfile
    

πŸ§ͺ Basic Testing

Start interactive session:

ollama run granite-pi

Sample prompts:

πŸ“Š Performance Expectations

| Pi Model | Response Time | Best Use Cases | |β€”β€”β€”-|β€”β€”β€”β€”β€”|β€”β€”β€”β€”β€”| | Pi 5 (4GB) | 10-20 seconds | Simple questions | | Pi 5 (8GB) | 5-15 seconds | Longer conversations |

πŸ› οΈ Troubleshooting

| Issue | Solution | |β€”β€”-|β€”β€”β€”-| | Slow responses | Check cooling, verify performance mode | | Out of memory | Close apps, reboot, reduce context size | | System freezes | Improve cooling, use 8GB model | | Model won’t load | Restart Ollama, check memory |

πŸ’‘ Usage Ideas

πŸ”’ Safety Tips

Note: The Pi will get warm during operation - ensure proper cooling!

Explore the possibilities of local AI on affordable hardware!

Full Guide Details Ollama GitHub IBM Granite Docs