ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
151 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Teacher-focused student profiling tool: OCR answer sheets, summarize performance, and update student profiles with targeted physics learning goals.
Comprehensive Python healthcare AI toolkit for clinical data processing, medical coding translation, and developing deep learning models like RETAIN and Transformers for EHR, physiological signals, and clinical prediction tasks.
Automated retrieval of PubMed scientific literature and generation of plain-language biomedical research summaries.
Analyze and audit React projects for security, performance, correctness, and architecture issues with actionable diagnostics and scoring.
Anthropic Claude integration patterns: streaming, RAG with pgvector, tool use, model selection (Haiku/Sonnet/Opus), prompt caching, and cost management for AI-powered engineering.
End-to-end startup idea validation using S.E.E.D. niche checks, STREAM 6-layer analysis, and Devil's Advocate inversion to generate PRDs.
Automate clinical report generation including CARE-compliant case reports, diagnostic summaries, clinical trial documentation (CSR/SAE), and patient notes with regulatory compliance.
Convert clinical text to natural, empathetic speech using ElevenLabs for patient instructions, medication reminders, and accessible health content.
Enforces a strict evidence-based debugging workflow using structured observation, hypothesis testing, and causality validation to eliminate speculation in technical investigations.
Physical hardware synthesis bridge for PAI. Generates blueprints, 3D printing code, SVG paths for laser cutting, and G-Code for CNC machining to bring agentic designs into the physical world.
Accelerate clinical and healthcare app development in Lovable. Perfect for OpenClaw Clinical Hackathon participants building MVPs with PHI-safe patterns.