z-card-image
Automated text-to-image rendering engine for social media posts, article covers, and long-form threads. Supports X-style, WeChat, and poster templates with high-precision text formatting and highlights.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
154 skills found
Automated text-to-image rendering engine for social media posts, article covers, and long-form threads. Supports X-style, WeChat, and poster templates with high-precision text formatting and highlights.
Bayesian modeling and probabilistic programming with PyMC. Build hierarchical models, perform MCMC sampling (NUTS), variational inference, and conduct rigorous model comparison using LOO and WAIC.
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
Generate real-time AI podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model with WebSocket streaming, complete with PCM to WAV conversion and frontend playback integration.
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Production-ready audio/video transcription using OpenAI Whisper. Features model selection, timing synchronization, speaker diarization, and batch processing for media workflows.
Classical machine learning with scikit-learn. Use for classification, regression, clustering, dimensionality reduction, preprocessing, model evaluation, and building robust ML pipelines in Python.
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
Remove AI-generated patterns and inject natural human voice into your writing. Fixes robotic phrasing, overuse of AI vocabulary, and sterile structure to make text sound authentic.
Specialized IDF (Information Display Frame) sub-agent for generating and reviewing CQRS Query Side implementations across Java, TypeScript, and Go.
Generates a random lucky number between 0 and 9999 for games, decision-making, or entertainment.
Generates data cleaning pipelines for pandas/polars/PySpark, handling missing values, duplicates, outliers, type conversions, and validation.