Run any Skill in Manus with one click

local-ai-models

Comprehensive guide for implementing on-device AI models on iOS using Foundation Models and MLX Swift frameworks. Use WHEN building iOS apps with (1) Local LLM inference, (2) Vision Language Models (VLMs), (3) Text embeddings, (4) Image generation, (5) Tool/function calling, (6) Multi-turn conversations, (7) Custom model integration, or (8) Structured generation.

Run Skill in Manus

Stars18

Forks6

UpdatedJanuary 6, 2026 at 21:30

Source

mintuz

mintuz/claude-plugins

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

File Explorer

12 files

SKILL.md

readonly

More from this repository

same repository

app-store-scraper

mintuz/claude-plugins

WHEN scraping iOS/macOS App Store data (apps, reviews, ratings, search); NOT for installing or testing apps; retrieves structured JSON data using iTunes/App Store APIs with curl and jq formatting

2026-01-0918

chatgpt-app-sdk

mintuz/claude-plugins

WHEN building ChatGPT apps using the OpenAI Apps SDK and MCP; create conversational, composable experiences with proper UX, UI, state management, and server patterns.

2026-01-0618

swiftui-architecture

mintuz/claude-plugins

WHEN building SwiftUI views, managing state, setting up shared services, or making architectural decisions; NOT for UIKit or legacy patterns; provides pure SwiftUI data flow without ViewModels using @State, @Binding, @Observable, and @Environment.

2026-01-0518

tailwind

mintuz/claude-plugins

WHEN building design systems or component libraries with Tailwind CSS; covers design tokens, CVA patterns and dark mode.

2026-01-0318

react

mintuz/claude-plugins

WHEN building React components/pages/apps; enforces scalable architecture, state management, API layer, performance patterns.

2026-01-0318

gps-method

mintuz/claude-plugins

Evidence-based goal achievement framework using Goal, Plan, and System methodology. Use when users want to set goals, create actionable plans, build execution systems, or diagnose why they're struggling to make progress on existing goals. Triggers include requests to "set a goal", "help me achieve", "create a plan", "why am I not making progress", or similar goal-setting and achievement queries.

2025-12-2818

name	local-ai-models
description	Comprehensive guide for implementing on-device AI models on iOS using Foundation Models and MLX Swift frameworks. Use WHEN building iOS apps with (1) Local LLM inference, (2) Vision Language Models (VLMs), (3) Text embeddings, (4) Image generation, (5) Tool/function calling, (6) Multi-turn conversations, (7) Custom model integration, or (8) Structured generation.

iOS On-Device AI Models

Production-ready guide for implementing on-device AI models in iOS apps using Apple's Foundation Models framework and MLX Swift.

When to Use This Skill

Implementing local LLM inference in iOS apps
Building chat interfaces with Foundation Models
Integrating Vision Language Models (VLMs)
Adding text embeddings or image generation
Implementing tool/function calling with LLMs
Managing multi-turn conversations
Optimizing memory usage for on-device models
Supporting internationalization in AI features

Core Principles

Availability First - Always check model availability before initialization
Stream Responses - Provide progressive UI updates for better UX
Session Persistence - Reuse LanguageModelSession for multi-turn conversations (Foundation Models)
Memory Awareness - Use quantized models and monitor memory usage
Async Everything - Load models asynchronously, never block the main thread
Locale Support - Use supportsLocale(_:) and locale instructions for Foundation Models

Quick Reference

Framework Comparison

Topic	Guide
Framework comparison and selection	framework-selection.md

Foundation Models (Apple's Framework)

Topic	Guide
Setup and configuration	foundation-models/setup.md
Chat patterns and conversations	foundation-models/chat-patterns.md

MLX Swift (Advanced Features)

Topic	Guide
Setup and configuration	mlx-swift/setup.md
Chat patterns with custom models	mlx-swift/chat-patterns.md
Vision Language Models (VLMs)	mlx-swift/vision-patterns.md
Tool calling, embeddings, structured gen	mlx-swift/advanced-patterns.md
Model quantization with MLX-LM	mlx-swift/quantization.md

Shared (Both Frameworks)

Topic	Guide
Best practices and optimization	shared/best-practices.md
Error handling and recovery	shared/error-handling.md
Testing strategies	shared/testing.md

Quick Decision Trees

Which framework should I use?

Do you need advanced features like:
- Vision Language Models (VLMs)
- Image generation
- Custom models beyond the system model
├── Yes → MLX Swift (references/mlx-swift/)
└── No → Is this a standard chat interface?
    ├── Yes → Foundation Models (simpler, recommended)
    └── No → Check framework-selection.md for guidance

Where should I start?

New to on-device AI?
└── Start with Foundation Models:
    1. Read framework-selection.md
    2. Follow foundation-models/setup.md
    3. Implement foundation-models/chat-patterns.md

Need advanced features?
└── Use MLX Swift:
    1. Read framework-selection.md
    2. Follow mlx-swift/setup.md
    3. Choose pattern:
       - Chat: mlx-swift/chat-patterns.md
       - Vision: mlx-swift/vision-patterns.md
       - Advanced: mlx-swift/advanced-patterns.md

Where should my model loading code live?

Is this model shared across features?
├── Yes → Create @Observable service in app/services/
└── No → Is it feature-specific?
    ├── Yes → Create @Observable class in feature/
    └── No → Load inline with @State (simple cases only)

How should I handle conversations?

Foundation Models:
└── Reuse LanguageModelSession for context
    (references/foundation-models/chat-patterns.md #multi-turn)

MLX Swift:
└── Implement custom context management
    (references/mlx-swift/chat-patterns.md)

What generation parameters should I use?

What's the use case?

Factual answers (summaries, facts)
└── temperature: 0.1-0.3

Balanced (chat, Q&A)
└── temperature: 0.6-0.8

Creative (storytelling, ideas)
└── temperature: 0.9-1.2

See references/shared/best-practices.md for details

local-ai-models

iOS On-Device AI Models

When to Use This Skill

Core Principles

Quick Reference

Framework Comparison

Foundation Models (Apple's Framework)

MLX Swift (Advanced Features)

Shared (Both Frameworks)

Quick Decision Trees

Which framework should I use?

Where should I start?

Where should my model loading code live?

How should I handle conversations?

What generation parameters should I use?

Resources

iOS On-Device AI Models

When to Use This Skill

Core Principles

Quick Reference

Framework Comparison

Foundation Models (Apple's Framework)

MLX Swift (Advanced Features)

Shared (Both Frameworks)

Quick Decision Trees

Which framework should I use?

Where should I start?

Where should my model loading code live?

How should I handle conversations?

What generation parameters should I use?

Resources