paper-reproduce
Systematic methodology for reproducing published academic papers using provided data, including sample selection, statistical verification, and automated reporting.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
121 skills found
Systematic methodology for reproducing published academic papers using provided data, including sample selection, statistical verification, and automated reporting.
Production-ready audio/video transcription using OpenAI Whisper. Features model selection, timing synchronization, speaker diarization, and batch processing for media workflows.
Automated single-cell RNA-seq quality control pipeline following scverse best practices. Performs MAD-based outlier detection, cell filtering, and diagnostic visualization for .h5ad and .h5 datasets.
Normalizes testing defect logs by correcting typos, abbreviations, and ambiguous descriptions based on product-specific codebooks and station validation.
Convert markdown PRDs into structured prd.json files for the Ralph autonomous AI agent system to enable repeatable, context-aware software development.
Find, review, and remove duplicate or near-duplicate images in FiftyOne datasets using computer vision similarity embeddings.
Specialized data engineering agent for designing ETL/ELT pipelines, defining data schemas, managing data quality, and implementing robust ingestion workflows.
High-performance document intelligence library for extracting text, tables, code, and metadata from 91+ file formats, with OCR and LLM-ready output.
Implement robust server-side and client-side input validation using sanitization and allowlists to prevent injection attacks and ensure data integrity.
Proven patterns for extracting, caching, and processing analytics data from GA4 and GSC using MCP servers.
Automates the release preparation process for MassGen by generating CHANGELOG entries, creating announcement drafts, and validating documentation integrity before git tagging.
Implement LlamaExtract for robust structured data extraction from PDF, DOCX, and PPTX files using Pydantic schemas.