Run any Skill in Manus with one click

aws-lambda-managed-instances

Evaluate, configure, and migrate workloads to AWS Lambda Managed Instances (LMI). Triggers on: Lambda Managed Instances, LMI, capacity provider, multi-concurrency Lambda, dedicated instance Lambda, EC2-backed Lambda, cold start elimination, Graviton Lambda, instance type for Lambda, Lambda cost optimization with Reserved Instances or Savings Plans. Also trigger when users describe high-volume predictable workloads seeking cost savings, or compare Lambda vs EC2 for steady-state traffic. For standard Lambda without LMI, use the aws-lambda skill instead.

Run Skill in Manus

Overview

Install command

npx skills add https://github.com/awslabs/agent-plugins --skill aws-lambda-managed-instances

Copy and paste this command into Claude Code to install the skill

Source

awslabs/agent-plugins

Stars771

Forks111

UpdatedJune 3, 2026 at 15:54

File Explorer

8 files

SKILL.md

readonly

More from this repository

same repository

aws-lambda

awslabs/agent-plugins

Design, build, deploy, test, and debug serverless applications with AWS Lambda. Triggers on phrases like: Lambda function, event source, serverless application, API Gateway, EventBridge, Step Functions, serverless API, event-driven architecture, Lambda trigger. For deploying non-serverless apps to AWS, use deploy-on-aws plugin instead.

2026-06-03771

deploy

awslabs/agent-plugins

Deploy applications to AWS. Triggers on phrases like: deploy to AWS, host on AWS, run this on AWS, AWS architecture, estimate AWS cost, generate infrastructure. Analyzes any codebase and deploys to optimal AWS services.

2026-06-01771

elastic-beanstalk

awslabs/agent-plugins

Deploy to AWS Elastic Beanstalk. Triggers on: elastic beanstalk, EB, managed EC2 platform, web app with managed patching, worker on EC2, Heroku alternative, don't want to manage servers or containers, migrate from Heroku, managed operational lifecycle. Covers Elastic Beanstalk on EC2 for web and worker applications.

2026-06-01771

dsql

awslabs/agent-plugins

Build with Aurora DSQL — manage schemas, execute queries, handle migrations, diagnose query plans, load data, and develop applications with a serverless, distributed SQL database. Covers IAM auth, multi-tenant patterns, MySQL-to-DSQL migration, DDL operations, query plan explainability, SQL compatibility validation, and bulk data loading. Triggers on phrases like: DSQL, Aurora DSQL, create DSQL table, DSQL schema, migrate to DSQL, distributed SQL database, serverless PostgreSQL-compatible database, DSQL query plan, DSQL EXPLAIN ANALYZE, why is my DSQL query slow, aurora-dsql-loader, load CSV into DSQL.

2026-06-01771

model-evaluation

awslabs/agent-plugins

Generates python code that evaluates SageMaker models. Supports two evaluation types: LLM-as-Judge and Custom Scorer. Use when the user says "evaluate my model", "test model performance", "how did my model perform", "compare models", or other similar requests.

2026-05-26771

hyperpod-cluster-debugger

awslabs/agent-plugins

Diagnose and remediate cluster-wide HyperPod (EKS or Slurm) problems — creation / deployment failures (CloudFormation, EFA health check, lifecycle scripts, capacity), EKS access, node replacement, CloudFormation nested-stack errors, post-maintenance rollback state, dangling nodes, autoscaler conflicts. Includes `--validate` pre-flight. Read-only.

2026-05-16771

Source

awslabs

awslabs/agent-plugins

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	aws-lambda-managed-instances
description	Evaluate, configure, and migrate workloads to AWS Lambda Managed Instances (LMI). Triggers on: Lambda Managed Instances, LMI, capacity provider, multi-concurrency Lambda, dedicated instance Lambda, EC2-backed Lambda, cold start elimination, Graviton Lambda, instance type for Lambda, Lambda cost optimization with Reserved Instances or Savings Plans. Also trigger when users describe high-volume predictable workloads seeking cost savings, or compare Lambda vs EC2 for steady-state traffic. For standard Lambda without LMI, use the aws-lambda skill instead.
argument-hint	[describe your workload or what you need help with]
metadata	{"tags":"lambda, lmi, managed-instances, ec2, capacity-provider, multi-concurrency, cost-optimization"}

AWS Lambda Managed Instances (LMI)

Run Lambda functions on current-generation EC2 instances in your account while AWS manages provisioning, patching, scaling, routing, and load balancing. Combines Lambda's developer experience with EC2's pricing and hardware options.

For standard Lambda development, see aws-lambda skill. For SAM/CDK deployment, see aws-serverless-deployment skill.

When to Load Reference Files

Cost comparison, pricing analysis, Lambda vs LMI cost, Savings Plans, or Reserved Instances -> see references/cost-comparison.md
Instance types, memory sizing, vCPU ratios, scaling tuning, or capacity provider config -> see references/configuration-guide.md
Thread safety, concurrency model, code review checklist, Powertools compatibility, or multi-concurrency readiness -> see references/thread-safety.md
Before/after code examples, runtime-specific migration (Node.js, Python, Java, .NET), or connection pooling -> see references/migration-patterns.md
IAM roles, VPC setup, CLI commands, SAM template, or CDK example -> see references/infrastructure-setup.md and scripts/setup-lmi.sh
Errors, throttling, debugging, or stuck deployments -> see references/troubleshooting.md

Quick Decision: Is LMI Right for This Workload?

Signal	LMI is a strong fit	Standard Lambda is better
Traffic	Steady, predictable, 50M+ req/mo	Bursty, unpredictable, long idle
Cost	Duration-heavy spend at scale	Low or sporadic invocations
Cold starts	Unacceptable (LMI eliminates for provisioned capacity; scale-out may have brief delays)	Tolerable or mitigated by SnapStart
Compute	Latest CPUs, specific families, high network bandwidth	Standard Lambda memory/CPU sufficient
Isolation	Dedicated EC2 instances in your account, full VPC control	Shared Firecracker micro-VMs acceptable
Scale-to-zero	Not needed (execution environments always running)	Required (pay nothing when idle)
Code readiness	Thread-safe (Node.js/Java/.NET) or any Python code	Non-thread-safe Node.js/Java/.NET, expensive to change

Instructions

Step 1: Assess the Workload

Gather these signals before recommending:

Traffic pattern: Steady vs bursty? Requests per second?
Current costs: Monthly Lambda spend? Existing Savings Plans?
Runtime: Node.js, Java, .NET, or Python?
Memory/CPU: How much memory? CPU-bound or I/O-bound?
Execution duration: Average and P99?
Concurrency readiness: Thread safety (Node.js/Java/.NET)? Shared /tmp paths? Per-invocation DB connections?
VPC: Already in a VPC? Private resource access needed?

Step 2: Build the Cost Comparison

REQUIRED: Present a cost comparison before recommending LMI. Compare at minimum:

Scenario	When it wins
Lambda on-demand	Low volume, bursty traffic
LMI on-demand	High volume, steady traffic

Rule of thumb: LMI becomes cost-competitive when your Lambda spend exceeds ~$1,000/month with steady traffic.

For discount analysis (Savings Plans, Reserved Instances), refer users to the AWS Pricing Calculator and references/cost-comparison.md for formulas and worked examples. Discount recommendations require workload-specific forecasting beyond this skill's scope.

Step 3: Configure the Deployment

Instance families (~450 types): C-series (compute, .xlarge+), M-series (general, .large+), R-series (memory, .large+). ARM (Graviton) for best price-performance.

Memory-to-vCPU ratios: 2:1 (compute), 4:1 (general, default), 8:1 (memory). Min 2 GB, max 32 GB.

Multi-concurrency defaults/vCPU: Node.js 64, Java 32, .NET 32, Python 16.

Scaling: MinExecutionEnvironments (default 3), MaxVCpuCount (default 400), TargetResourceUtilization.

See references/configuration-guide.md for decision trees and detailed tuning.

Step 4: Migrate the Code

Review code for concurrency safety. LMI runs multiple invocations concurrently per execution environment, but the model differs by runtime:

Python: Process-based isolation — globals are NOT shared. No thread-safety changes needed. Focus on /tmp conflicts and memory sizing (per-process × concurrency).
Node.js: Worker threads — globals shared within a worker. Requires async safety. Callback handlers not supported on Node.js 22.
Java/.NET: OS threads/Tasks — handler shared across threads. Requires full thread safety.

Common issues (all runtimes): shared /tmp paths, per-invocation DB connections. Thread-safety issues (Node.js/Java/.NET only): mutable globals, non-thread-safe libs.

See references/thread-safety.md for the review checklist and references/migration-patterns.md for runtime-specific before/after code.

Step 5: Set Up Infrastructure

Create two IAM roles: execution role (for the function) and operator role (for capacity provider EC2 management)
Configure VPC with subnets across multiple AZs (recommended 3+ for resiliency)
Create capacity provider with VPC config and scaling limits
Create or update function with capacity provider attachment
Publish a version (triggers instance provisioning)

See references/infrastructure-setup.md for CLI commands and SAM templates.

Step 6: Validate and Cut Over

Deploy to a non-production environment first
Monitor CloudWatch: CPU utilization, memory, concurrency, throttle rate
Gradual traffic shift with weighted aliases (10% → 50% → 100%)
Compare costs after 1-2 weeks of production data
Decommission standard Lambda once stable

Best Practices

Configuration

Do: Start with 4:1 ratio and runtime default concurrency
Do: Use ARM (Graviton) unless x86 dependencies exist
Do: Let Lambda choose instance types unless specific hardware needed
Do: Set MaxVCpuCount to control cost ceiling
Don't: Set MinExecutionEnvironments below 3 in production (reduces multi-AZ coverage). Non-prod environments can use 1 as the minimum.
Don't: Over-restrict instance types (lowers availability)

Migration

Do: Start with I/O-heavy functions (benefit most from multi-concurrency; CPU-bound functions compete for same CPU)
Do: Review code for concurrency safety before attaching to capacity provider (thread safety for Node.js/Java/.NET; /tmp and memory for Python)
Do: Use weighted aliases for gradual traffic shift
Do: Include request IDs in all log statements
Do: Initialize DB pools and SDK clients outside the handler
Do: Estimate total /tmp usage under max concurrency
Don't: Write to hardcoded /tmp paths without request-unique naming
Don't: Skip cost comparison — LMI is not always cheaper

Operations

Do: Set CloudWatch alarms on throttle rate > 1% and CPU > 80%
Don't: Manually terminate LMI EC2 instances (delete the capacity provider instead)
Don't: Forget to publish a version — unpublished functions cannot run on LMI

Limits Quick Reference

Resource	Limit
Memory	2 GB min, 32 GB max
Concurrency/vCPU	64 (Node.js), 32 (Java/.NET), 16 (Python)
Instance lifespan	~12 hours (auto-replaced by Lambda)
EE lifespan	~4 hours (auto-replaced by Lambda)
Runtimes	Node.js, Java, .NET, Python
Instance families	C (.xlarge+), M (.large+), R (.large+)
Scaling	Doubles within 5 min without throttles

Troubleshooting Quick Reference

Issue	Cause	Fix
429 throttles	Traffic exceeds scaling speed	Increase MinExecutionEnvironments or lower TargetResourceUtilization
Function stuck PENDING	Provisioning instances	Wait; check VPC/IAM config
Architecture mismatch	Function ≠ capacity provider arch	Align both to same architecture
Cannot terminate instances	Managed by capacity provider	Delete capacity provider instead
Race conditions	Code not thread-safe	See references/thread-safety.md

See references/troubleshooting.md for detailed resolution steps.

Configuration

AWS CLI Setup

REQUIRED: AWS credentials configured on the host machine.

Verify access: Run aws sts get-caller-identity

Regional Availability

Currently available: us-east-1, us-east-2, us-west-2, ap-northeast-1, eu-west-1. Expanding to all commercial regions soon.

Check the Lambda Managed Instances documentation for the latest regional availability.

Language Selection

Default: TypeScript

Override: "use Python" → Python, "use JavaScript" → JavaScript. When not specified, ALWAYS use TypeScript.

IaC Framework Selection

Default: CDK

Override: "use SAM" → SAM YAML, "use CloudFormation" → CloudFormation YAML. When not specified, ALWAYS use CDK.

aws-lambda-managed-instances

More from this repository

AWS Lambda Managed Instances (LMI)

When to Load Reference Files

Quick Decision: Is LMI Right for This Workload?

Instructions

Step 1: Assess the Workload

Step 2: Build the Cost Comparison

Step 3: Configure the Deployment

Step 4: Migrate the Code

Step 5: Set Up Infrastructure

Step 6: Validate and Cut Over

Best Practices

Configuration

Migration

Operations

Limits Quick Reference

Troubleshooting Quick Reference

Configuration

AWS CLI Setup

Regional Availability

Language Selection

IaC Framework Selection

Error Scenarios

Serverless MCP Server Unavailable

Unsupported Runtime

Unsupported Region

Resources

AWS Lambda Managed Instances (LMI)

When to Load Reference Files

Quick Decision: Is LMI Right for This Workload?

Instructions

Step 1: Assess the Workload

Step 2: Build the Cost Comparison

Step 3: Configure the Deployment

Step 4: Migrate the Code

Step 5: Set Up Infrastructure

Step 6: Validate and Cut Over

Best Practices

Configuration

Migration

Operations

Limits Quick Reference

Troubleshooting Quick Reference

Configuration

AWS CLI Setup

Regional Availability

Language Selection

IaC Framework Selection

Error Scenarios

Serverless MCP Server Unavailable

Unsupported Runtime

Unsupported Region

Resources

More from this repository