Maintainer shelf

Orchestra Research

Reusable agent skills published under this GitHub owner—sorted by stars with aggregate signals below.

62 skills319K GitHub stars0 weekly installsGitHub

Strongest repo by stars

orchestra-research/ai-research-skills

62 skills · 319K stars total

Open repo

62 skills

crewai

This skill helps you orchestrate teams of autonomous agents for complex tasks with memory, roles, and production-ready workflows.

ApiAutomationBackendData+3

vllm

This skill helps deploy high-throughput LLM serving with vLLM, enabling OpenAI-compatible endpoints, quantization, and tensor parallelism for production

ApiBackendDevopsInfra+2

nemo-guardrails

This skill enforces runtime safety for LLMs with configurable jailbreaking, toxicity, PII, and fact-checking rails to improve reliability.

AutomationBackendDevopsObservability+3

audiocraft

This skill helps you generate music or sounds from text descriptions using AudioCraft, enabling melody-conditioned and stereo audio output.

ApiAutomationContentDocs+2

stable-diffusion

This skill helps you generate high-quality images from text prompts, perform image-to-image tasks, and optimize diffusion workflows with Stable Diffusion.

ApiAutomationDocsProductivity+2

gptq

This skill helps you compress large language models to 4-bit precision with minimal accuracy loss, enabling faster inference and smaller memory footprints.

PerformanceResearchTex

trl-fine-tuning

This skill guides fine-tuning LLMs with TRL for instruction tuning, preference alignment, and reward-based optimization, aligning models to human feedback.

AutomationDataProductivityResearch+2

skypilot

This skill helps orchestrate ML workloads across multiple clouds with automatic cost optimization and spot instance recovery.

AnalyticsAutomationCloudTex

nemo-curator

This skill optimizes LLM data curation with GPU-accelerated, multi-modal cleaning, deduplication, and PII redaction to improve training data quality.

AnalyticsAutomationCloudData+3

pytorch-lightning

This skill helps you streamline PyTorch Lightning training, automate distributed execution, and reduce boilerplate for scalable, reproducible experiments.

AutomationDataDebuggingDocs+3

constitutional-ai

This skill helps you align AI safety using self-critique and AI feedback, reducing harmful outputs without human labeling.

AutomationCode ReviewDocsResearch+2

ray-train

This skill orchestrates distributed training with Ray Train to scale PyTorch, TF, and HuggingFace across clusters, boosting efficiency and fault tolerance.

AnalyticsCloudDevopsKubernetes+2

axolotl

This skill provides expert guidance for fine-tuning LLMs with Axolotl, including YAML configs, 100+ models, and multimodal support.

DevopsResearchTex

llama-factory

This skill provides expert guidance for fine-tuning LLaMA models with Llama-Factory, covering APIs, setup, and best practices for multimodal, 8-bit QLoRA

ApiDebuggingDocsScripting+1

faiss

This skill enables fast billion-scale vector similarity with FAISS, guiding deployment, index selection, and GPU-accelerated search for high-performance

AnalyticsBackendDatabasePerformance+1

unsloth

This skill provides expert guidance for fast fine-tuning with Unsloth, enabling 2-5x training speed and reduced memory usage.

AutomationDebuggingPerformanceResearch+1

guidance

This skill helps you enforce structured generation with regex and grammars, guaranteeing valid JSON/XML/code and guiding multi-step workflows.

DebuggingDocsPlanningProductivity+3

mamba

This skill helps you deploy and experiment with Mamba selective state-space models for efficient linear-time sequence processing on GPUs.

PerformanceResearchTex

blip-2

This skill helps you perform vision-language tasks such as captioning, VQA, and multimodal chat using BLIP-2 with frozen encoders.

AnalyticsDataResearchTex

saelens

This skill helps you train and analyze Sparse Autoencoders with SAELens to extract interpretable, monosemantic features from neural activations.

AnalyticsProductivityResearchTex

dspy

This skill helps you build complex AI systems with declarative LM programming, automatic prompt optimization and modular RAG pipelines for reliable outputs.

AutomationDataDevopsDocs+2

sentence-transformers

This skill helps generate high-quality embeddings for semantic search and retrieval using sentence-transformers, enabling efficient RAG, clustering, and

AnalyticsDataPerformanceProductivity+2

instructor

This skill extracts and validates structured data from LLM responses using Pydantic, with automatic retries and real-time streaming.

ApiAutomationDataTex

nanogpt

This skill helps you learn transformer basics by guiding you through nanoGPT style GPT-2 reproduction, training, and experimentation for educational purposes.

DataDocsResearchTex

accelerate

This skill simplifies distributed training with HuggingFace Accelerate, enabling seamless multi-GPU/TPU setups via a four-line integration.

AutomationDocsPerformanceScripting+1

lambda-labs

This skill helps you manage Lambda Labs GPU Cloud resources for scalable ML training and inference with persistent storage and easy SSH access.

AutomationCloudInfraPerformance+2

openrlhf

This skill speeds high-performance RLHF training for large models with Ray and vLLM acceleration, simplifying distributed PPO GRPO DPO workflows

CliDebuggingPerformanceProductivity+3

peft

This skill enables memory-efficient fine-tuning of large language models using LoRA, QLoRA, and adapters to save GPU memory.

DocsPerformancePythonResearch+1

brainstorming-research-ideas

This skill guides researchers through structured ideation frameworks to uncover high-impact research directions, offering actionable prompts and evaluation

AnalyticsDocsPlanningStrategy+1

20-ml-paper-writing

This skill helps you draft publication-ready ML papers for top conferences by providing proactive drafting, citation verification, LaTeX templates, and

DocsPlanningProductivityWriting+1

llamaindex

This skill helps you build powerful RAG applications by ingesting documents, indexing data, and querying with LlamaIndex.

AnalyticsDataDocsProductivity+1

openpi

This skill helps you fine-tune and deploy OpenPI pi0, pi0-fast, or pi0.5 models for robot policy inference across ALOHA, DROID, LIBERO.

ApiDebuggingDevopsScripting+1

cosmos-policy

This skill evaluates NVIDIA Cosmos Policy on LIBERO and RoboCasa simulations, enabling efficient setup, headless rendering, and latency profiling for robotics

DebuggingPerformancePythonTesting+1

openvla-oft

This skill fine-tunes and evaluates OpenVLA-OFT policies for robot action generation with LoRA and FiLM conditioning.

AutomationDebuggingDockerPython+2

0-autoresearch-skill

This skill automates end-to-end AI research projects by managing loops, literature search, experiments, and synthesis to guide direction and produce papers.

AutomationProductivityResearchWriting+1

creative-thinking-for-research

This skill helps researchers generate genuinely novel CS and AI ideas by applying cognitive science frameworks like combinatorial creativity and constraint

DataPlanningResearchStrategy+1

prompt-guard

This skill detects prompt injections and jailbreak attempts in LLM apps, ensuring safer interactions and reliable third-party data filtering.

ApiBackendDataSecurity+2

pytorch-fsdp2

This skill helps you integrate PyTorch FSDP2 into training scripts with correct initialization, sharding, mixed precision, and DTensor-based checkpointing.

DebuggingDevopsObservabilityPerformance+2

verl

This skill guides reinforcement learning based training of large language models using verl across PPO, GRPO, and other RL algorithms.

BackendProductivityResearchTex

miles

This skill guides enterprise RL training with miles for large MoE models, enabling FP8/INT4, train-inference alignment, and speculative RL for throughput.

AutomationDataPerformanceResearch+1

slime

This skill helps you accelerate RL-based LLM post-training with slime's Megatron-LM and SGLang for scalable data generation and rollout.

DataMonitoringObservabilityProductivity+2

torchtitan

This skill enables scalable pretraining of large language models using PyTorch Torchtitan 4D parallelism across GPUs, delivering faster training with efficient

AutomationCloudDocsPerformance+1

nemo-evaluator

This skill helps you benchmark LLMs across 100+ benchmarks with containerized, scalable evaluation on local Docker, Slurm HPC, or cloud platforms.

AnalyticsAutomationCloudDevops+2

gguf

This skill helps you deploy AI models efficiently on consumer hardware using GGUF quantization for flexible 2-8 bit inference.

BackendDevopsInfraPerformance+1

phoenix

Analytics

This skill helps you instrument, trace, evaluate, and monitor LLM applications with Phoenix for debugging, testing, and real-time observability.

ApiBackendDataDatabase+4

deepspeed

This skill provides expert guidance for distributed training with DeepSpeed, covering ZeRO, pipeline parallelism, FP16/BF16/FP8, and optimization best

DebuggingPerformanceProductivityResearch+1

bigcode-evaluation-harness

This skill benchmarks code generation models across 15+ tasks, providing pass@k metrics and multi-language evaluation for robust code quality.

AnalyticsDataResearchTesting+1

long-context

This skill helps extend transformer context windows for long documents using RoPE, YaRN, ALiBi, and position interpolation to improve efficiency and

AnalyticsPerformanceResearchTex

weights-and-biases

This skill helps you track ML experiments, visualize training, sweep hyperparameters, and manage models using Weights & Biases for streamlined MLOps.

AnalyticsAutomationDataDevops+2

tensorrt-llm

This skill optimizes LLM inference on NVIDIA GPUs with TensorRT for maximum throughput and lowest latency in production.

BackendCloudInfraPerformance+1

llama-cpp

This skill enables efficient LLM inference on CPU and non-NVIDIA hardware, enabling edge deployment and Apple Silicon performance with GGUF quantization.

BackendInfraPerformanceTex

sentencepiece

This skill helps you implement language-independent tokenization with SentencePiece to support multilingual models and reproducible vocabularies.

AnalyticsDataProductResearch+2

chroma

This skill helps you implement open-source embedding storage and semantic search for AI apps with RAG workflows using Chroma.

AnalyticsBackendDataDatabase+2

litgpt

This skill helps you implement and train LLMs with LitGPT across 20+ pretrained architectures for clean, production-ready workflows.

AutomationBackendCliDevops+3

bitsandbytes

This skill helps you quantize large language models to 8-bit or 4-bit with minimal accuracy loss to reduce memory and speed up inference.

AutomationPerformanceProductivityResearch+1

flash-attention

This skill accelerates transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction on long sequences.

AutomationDebuggingInfraPerformance+1

pinecone

This skill helps you manage production-grade vector search with Pinecone, delivering low-latency, serverless indexing and hybrid search capabilities.

AnalyticsCloudDatabaseDevops+3

megatron-core

This skill helps you optimize large-scale LLM training with Megatron-Core, enabling efficient 2B-462B parameter models using advanced parallelism.

InfraPerformanceResearchScripting+1

speculative-decoding

This skill accelerates LLM inference using speculative decoding, Medusa heads, and lookahead techniques to boost speed and reduce latency.

BackendPerformanceResearchTex

tensorboard

This skill helps you visualize training metrics, debug models, compare experiments, and profile performance with TensorBoard.

AnalyticsDebuggingObservabilityPerformance+1

moe-training

This skill helps you train large-scale Mixture of Experts models with DeepSpeed or HuggingFace efficiently, reducing compute while expanding capacity.

PerformanceProductivityResearchScripting+1

pytorch-fsdp

This skill provides expert guidance for Fully Sharded Data Parallel training with PyTorch FSDP, covering sharding, mixed precision, and offloading.

Code ReviewDebuggingDocsPerformance+2

Full skills directory

Search and filter the entire library—then dive back into this owner anytime.

Open directory →

Browse by topic

See how skills cluster by workflow problem space—not just by repository.

Explore topics →

Built by

VeilStrat

AI signals for GTM teams