ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
411 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Analyze local system hardware (RAM, CPU, GPU/VRAM) to receive expert recommendations for optimized local LLM models, quantization settings, and performance estimates.
SwiftUI architecture and implementation patterns for native iOS and macOS development, focusing on state management, view composition, and data persistence.
Search the live web using Baidu AI Search Engine (BDSE) for real-time information, documentation, and research topics.
Create, alter, and validate Snowflake semantic views using the Snowflake CLI.
Search the web for real-time data and research using the Turing Tavily proxy. Use for up-to-date information, current events, and web-based research tasks.
Expert LangGraph architect skill for designing stateful, multi-actor AI agent workflows with robust persistence, conditional branching, and ReAct patterns.
Python coding assistant providing best practices, PEP 8 enforcement, automated testing with pytest, and dependency management using uv.
Implement secure backend authentication (JWT, OAuth, Sessions) and authorization (RBAC, ABAC) patterns, including password hashing, MFA, and security best practices.
Build and execute state-machine based automations with human-in-the-loop support for complex, multi-step business processes.
Build a cohesive, constraint-based design system using the Design Graph methodology. Automate the creation of design tokens, typography scales, components, variants, and themes.
Autonomous multi-agent LinkedIn system using LangGraph and Claude Opus 4.5 for trend research, content creation, voice profiling, and analytics-driven optimization.