تشغيل أي مهارة في Manus بنقرة واحدة

ops-verify-jobs

النجوم٠

التفرعات١

آخر تحديث٢٨ مايو ٢٠٢٦ في ١٢:٤٥

Verify that Solid Queue background jobs are running properly. Checks worker health, recurring job execution, and optionally triggers a test job to prove end-to-end processing. Supports local, staging, and production targets.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

CodySwannGT

CodySwannGT/railsstarter

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

مطوّرو البرمجياتمهن الحاسوب والرياضيات·SOC 15-1252

SKILL.md

readonly

name	ops-verify-jobs
description	Verify that Solid Queue background jobs are running properly. Checks worker health, recurring job execution, and optionally triggers a test job to prove end-to-end processing. Supports local, staging, and production targets.
allowed-tools	["Bash","Read"]

Verify Background Jobs

Verify that Solid Queue background jobs are running and processing correctly in the specified environment.

Target environment: $ARGUMENTS (expected: local, staging, or production; default: staging)

Workflow

Step 1: Set Environment Variables

Environment	Profile	Cluster	Worker Service	Worker Log Group
local	—	—	`worker` (Docker)	Docker Compose logs
staging	`your-project-staging`	`webCluster`	`worker-service`	Discover via `aws logs describe-log-groups`
production	`your-project-production`	`webCluster`	`worker-service`	Discover via `aws logs describe-log-groups`

Step 2: Check Worker Service Health

If local

docker compose ps worker

Confirm the worker container is running and healthy.

If staging or production

Check AWS session:
```
aws sts get-caller-identity --profile <profile>
```
If expired: aws sso login --profile <profile>

Check ECS service status:

aws ecs describe-services \
  --cluster webCluster \
  --services worker-service \
  --profile <profile> \
  --region us-east-1 \
  --query 'services[].{name:serviceName,status:status,running:runningCount,desired:desiredCount,rollout:deployments[0].rolloutState}' \
  --output table

Verify: running >= 1, status = ACTIVE, rollout = COMPLETED.

Step 3: Verify Recurring Jobs Are Executing

If local

docker compose logs --tail=100 worker 2>&1 | grep -E "(heartbeat|PublishCloudWatchMetrics|SolidQueue::RecurringJob)"

If staging or production

Discover log groups:

aws logs describe-log-groups \
  --profile <profile> \
  --region us-east-1 \
  --query 'logGroups[].logGroupName' \
  --output table

Search for recurring job execution in the worker log group (last 15 minutes):

aws logs filter-log-events \
  --log-group-name <worker-log-group> \
  --filter-pattern "Performing" \
  --start-time $(date -v-15M +%s000) \
  --profile <profile> \
  --region us-east-1 \
  --query 'events[].message' \
  --output text

Check for job failures:

aws logs filter-log-events \
  --log-group-name <worker-log-group> \
  --filter-pattern "?Failed ?\"Error performing\"" \
  --start-time $(date -v-15M +%s000) \
  --profile <profile> \
  --region us-east-1 \
  --query 'events[].message' \
  --output text

Step 4: Trigger a Test Job (End-to-End Proof)

This step enqueues a VerifyJobExecutionJob with a unique marker, then searches worker logs for that marker to prove the full pipeline works: enqueue -> pickup -> execute -> log.

If local

MARKER="verify-$(openssl rand -hex 8)"
echo "Marker: $MARKER"

# Enqueue the job
docker compose run --rm web bin/rails runner "VerifyJobExecutionJob.perform_later('$MARKER')"

# Wait for the worker to process it
sleep 5

# Search for the marker in worker logs
docker compose logs --tail=50 worker 2>&1 | grep "$MARKER"

If staging or production

Generate a unique marker:

MARKER="verify-$(openssl rand -hex 8)"
echo "Marker: $MARKER"

Get a running web task for ECS exec:

TASK_ARN=$(aws ecs list-tasks \
  --cluster webCluster \
  --service-name web-rails-service \
  --profile <profile> \
  --query 'taskArns[0]' \
  --output text)

TASK_ID=$(echo "$TASK_ARN" | awk -F/ '{print $NF}')

CONTAINER_NAME=$(aws ecs describe-tasks \
  --cluster webCluster \
  --tasks "$TASK_ID" \
  --profile <profile> \
  --query 'tasks[0].containers[?starts_with(name, `ecs-service-connect`) == `false`].name' \
  --output text | awk '{print $1}')

Enqueue the test job via ECS exec:

aws ecs execute-command \
  --cluster webCluster \
  --task "$TASK_ID" \
  --container "$CONTAINER_NAME" \
  --interactive \
  --command "bin/rails runner \"VerifyJobExecutionJob.perform_later('$MARKER')\"" \
  --profile <profile>

Wait for the worker to process it (30 seconds should be more than enough):
```
sleep 30
```

Search worker logs for the marker:

aws logs filter-log-events \
  --log-group-name <worker-log-group> \
  --filter-pattern "$MARKER" \
  --start-time $(date -v-2M +%s000) \
  --profile <profile> \
  --region us-east-1 \
  --query 'events[].message' \
  --output text

Evaluate result:
- If the marker appears with status=completed: PASS — jobs are being enqueued and processed end-to-end
- If the marker does not appear: FAIL — the worker is not picking up new jobs. Check worker service health, logs for errors, and SolidQueue dispatcher status

Step 5: Report Results

Summarize findings in a table:

Check	Result
Worker service running	Yes/No (count, status)
Recurring jobs executing	Yes/No (list which ones, any gaps)
Job failures in last 15 min	Count (list if any)
Test job (end-to-end)	PASS/FAIL (marker, timing)
Non-job errors (OTEL, etc.)	Note any noise

If any check fails, provide the specific error output and a recommended next step.

Execution

Verify jobs now for the specified environment.

المزيد من هذا المستودع

نفس المستودع

ops-check-logs

CodySwannGT/railsstarter

Check application logs from local Docker Compose or remote AWS CloudWatch environments. Supports local, staging, and production targets.

2026-05-280

ops-deploy

CodySwannGT/railsstarter

Deploy the Your Project Rails application to staging or production. Supports local builds via bin/deploy-staging and CI/CD via branch pushes.

2026-05-280

ops-run-local

CodySwannGT/railsstarter

Manage the local Docker Compose development environment. Supports start, stop, restart, and status operations.

2026-05-280

ops-verify-telemetry

CodySwannGT/railsstarter

Verify that OpenTelemetry traces are being collected and exported to X-Ray. Check trace health, find slow requests, investigate errors, and view service dependencies.

2026-05-280

action-mailer-best-practices

CodySwannGT/railsstarter

Best practices for Rails Action Mailer. Use when writing new mailers, refactoring existing mailers, or when a mailer has inline business logic, missing previews, synchronous delivery, or mixed responsibilities. Applies patterns - single-responsibility mailers, parameterized mailers, deliver_later by default, mailer concerns, service delegation, previews, and structured testing.

2026-03-110

active-job-best-practices

CodySwannGT/railsstarter

Best practices for Rails Active Job with Solid Queue. Use when writing new background jobs, refactoring existing jobs, or when a job has mixed responsibilities, inline business logic, non-idempotent design, or missing error handling. Applies patterns - single-responsibility jobs, argument serialization, idempotent design, retry/discard strategies, queue management, recurring schedules, job concerns, and service delegation.

2026-03-110

name	ops-verify-jobs
description	Verify that Solid Queue background jobs are running properly. Checks worker health, recurring job execution, and optionally triggers a test job to prove end-to-end processing. Supports local, staging, and production targets.
allowed-tools	["Bash","Read"]

Verify Background Jobs

Verify that Solid Queue background jobs are running and processing correctly in the specified environment.

Target environment: $ARGUMENTS (expected: local, staging, or production; default: staging)

Workflow

Step 1: Set Environment Variables

Environment	Profile	Cluster	Worker Service	Worker Log Group
local	—	—	`worker` (Docker)	Docker Compose logs
staging	`your-project-staging`	`webCluster`	`worker-service`	Discover via `aws logs describe-log-groups`
production	`your-project-production`	`webCluster`	`worker-service`	Discover via `aws logs describe-log-groups`

Step 2: Check Worker Service Health

If local

docker compose ps worker

Confirm the worker container is running and healthy.

If staging or production

Check AWS session:
```
aws sts get-caller-identity --profile <profile>
```
If expired: aws sso login --profile <profile>

Check ECS service status:

aws ecs describe-services \
  --cluster webCluster \
  --services worker-service \
  --profile <profile> \
  --region us-east-1 \
  --query 'services[].{name:serviceName,status:status,running:runningCount,desired:desiredCount,rollout:deployments[0].rolloutState}' \
  --output table

Verify: running >= 1, status = ACTIVE, rollout = COMPLETED.

Step 3: Verify Recurring Jobs Are Executing

If local

docker compose logs --tail=100 worker 2>&1 | grep -E "(heartbeat|PublishCloudWatchMetrics|SolidQueue::RecurringJob)"

If staging or production

Discover log groups:

aws logs describe-log-groups \
  --profile <profile> \
  --region us-east-1 \
  --query 'logGroups[].logGroupName' \
  --output table

Search for recurring job execution in the worker log group (last 15 minutes):

aws logs filter-log-events \
  --log-group-name <worker-log-group> \
  --filter-pattern "Performing" \
  --start-time $(date -v-15M +%s000) \
  --profile <profile> \
  --region us-east-1 \
  --query 'events[].message' \
  --output text

Check for job failures:

aws logs filter-log-events \
  --log-group-name <worker-log-group> \
  --filter-pattern "?Failed ?\"Error performing\"" \
  --start-time $(date -v-15M +%s000) \
  --profile <profile> \
  --region us-east-1 \
  --query 'events[].message' \
  --output text

Step 4: Trigger a Test Job (End-to-End Proof)

This step enqueues a VerifyJobExecutionJob with a unique marker, then searches worker logs for that marker to prove the full pipeline works: enqueue -> pickup -> execute -> log.

If local

MARKER="verify-$(openssl rand -hex 8)"
echo "Marker: $MARKER"

# Enqueue the job
docker compose run --rm web bin/rails runner "VerifyJobExecutionJob.perform_later('$MARKER')"

# Wait for the worker to process it
sleep 5

# Search for the marker in worker logs
docker compose logs --tail=50 worker 2>&1 | grep "$MARKER"

If staging or production

Generate a unique marker:

MARKER="verify-$(openssl rand -hex 8)"
echo "Marker: $MARKER"

Get a running web task for ECS exec:

TASK_ARN=$(aws ecs list-tasks \
  --cluster webCluster \
  --service-name web-rails-service \
  --profile <profile> \
  --query 'taskArns[0]' \
  --output text)

TASK_ID=$(echo "$TASK_ARN" | awk -F/ '{print $NF}')

CONTAINER_NAME=$(aws ecs describe-tasks \
  --cluster webCluster \
  --tasks "$TASK_ID" \
  --profile <profile> \
  --query 'tasks[0].containers[?starts_with(name, `ecs-service-connect`) == `false`].name' \
  --output text | awk '{print $1}')

Enqueue the test job via ECS exec:

aws ecs execute-command \
  --cluster webCluster \
  --task "$TASK_ID" \
  --container "$CONTAINER_NAME" \
  --interactive \
  --command "bin/rails runner \"VerifyJobExecutionJob.perform_later('$MARKER')\"" \
  --profile <profile>

Wait for the worker to process it (30 seconds should be more than enough):
```
sleep 30
```

Search worker logs for the marker:

aws logs filter-log-events \
  --log-group-name <worker-log-group> \
  --filter-pattern "$MARKER" \
  --start-time $(date -v-2M +%s000) \
  --profile <profile> \
  --region us-east-1 \
  --query 'events[].message' \
  --output text

Evaluate result:
- If the marker appears with status=completed: PASS — jobs are being enqueued and processed end-to-end
- If the marker does not appear: FAIL — the worker is not picking up new jobs. Check worker service health, logs for errors, and SolidQueue dispatcher status

Step 5: Report Results

Summarize findings in a table:

Check	Result
Worker service running	Yes/No (count, status)
Recurring jobs executing	Yes/No (list which ones, any gaps)
Job failures in last 15 min	Count (list if any)
Test job (end-to-end)	PASS/FAIL (marker, timing)
Non-job errors (OTEL, etc.)	Note any noise

If any check fails, provide the specific error output and a recommended next step.

Execution

Verify jobs now for the specified environment.