Free SKILL.md scraped from GitHub. Clone the repo or copy the file directly into your Claude Code skills directory.
npx versuz@latest install hiyenwong-ai-collection-collection-skills-evaluating-large-language-models-trained-on-cogit clone https://github.com/hiyenwong/ai_collection.gitcp ai_collection/SKILL.MD ~/.claude/skills/hiyenwong-ai-collection-collection-skills-evaluating-large-language-models-trained-on-co/SKILL.md--- name: evaluating-large-language-models-trained-on-code-- description: Skill for AI agent capabilities --- # evaluating-large-language-models-trained-on-code - Evaluating large language models trained on code ## Description **Source:** https://openai.com/index/evaluating-large-language-models-trained-on-code **Date:** Wed, 07 Jul 2021 07:00:00 GMT **Category:** OpenAI Research ## Activation Keywords - evaluating large language models trained on code - openai evaluating-large-language-models-trained-on-code - evaluating large language models trained on code ## Core Concepts ### Key Points - Extract from OpenAI research paper - See original paper for detailed methodology ## Step-by-Step Instructions ### 1. Background ```python # Research background # See original paper: https://openai.com/index/evaluating-large-language-models-trained-on-code ``` ### 2. Implementation ```python # Implementation details # Refer to OpenAI's official implementation ``` ## Tools Used - `read` - Read research papers - `web_fetch` - Fetch online resources - `exec` - Run implementation code ## Example Use Cases ### 1. Basic Usage ```python # Example usage based on research ``` ## Instructions for Agents Follow these steps when applying this skill: ### Step 1: Background ## Examples ### Example 1: Basic Application **User:** I need to apply evaluating-large-language-models-trained-on-code - Evaluating large language models trained on code to my analysis. **Agent:** I'll help you apply evaluating-large-language-models-trained-on-code. First, let me understand your specific use case... **Context:** Apply the methodology ### Example 2: Advanced Scenario **User:** Complex analysis scenario **Agent:** Based on the methodology, I'll guide you through the advanced application... ### Example 2: Advanced Application **User:** What are the key considerations for evaluating-large-language-models-trained-on-code? **Agent:** Let me search for the latest research and best practices... ## Related Skills - Other OpenAI research skills ## References - https://openai.com/index/evaluating-large-language-models-trained-on-code --- **Created:** 2026-03-29 14:25 **Author:** Aerial (from OpenAI Research)