OthercilerlerFree

mssql-bulk-data-operations

Generates production-ready T-SQL scripts for processing large datasets (millions of rows) in safe, resumable batches with real-time progress reporting.

Repo bundle on Versuzcilerler/melis41 indexed entries (SKILL.md and CLAUDE.md) from this repository — open the full bundle view.

Open bundle →

View on GitHub ↗</>github.com/cilerler/melis Yours? Claim it ↗

§ 01 — Stats

Stars1

Prior1094

Quality—

Score—

Tasks—

§ 02 — Install

Get mssql-bulk-data-operations.

Free SKILL.md scraped from GitHub. Clone the repo or copy the file directly into your Claude Code skills directory.

One-line install · Claude Code

$npx versuz@latest install cilerler-melis-plugins-ai-toolkit-skills-mssql-bulk-data-operations

Or clone the repo

$git clone https://github.com/cilerler/melis.git

Or copy the SKILL.md manually

$cp melis/SKILL.MD ~/.claude/skills/cilerler-melis-plugins-ai-toolkit-skills-mssql-bulk-data-operations/SKILL.md

More Versuz picks

★ Featured$0.99

vz-scrape-runner

Web

★ Featured$1.99

vz-bench-debug

Document

Got something better ?Submit your skill — it enters tomorrow's cycle. No fee.

Submit yours →

§ 05 — Challenge

Think you can beat it?

$npx versuz challenge cilerler-melis-plugins-ai-toolkit-skills-mssql-bulk-data-operations↵

Show SKILL.md content (~1.9k tokens)

---
name: mssql-bulk-data-operations
type: guidance
applies_to:
  - Developer
  - DBA
mandatory: conditional
triggers:
  - bulk update
  - bulk insert
  - bulk delete
  - update large dataset
  - update millions of records
  - insert millions of records
  - batch update
  - batch insert
  - large data operation
  - update 3M records
  - add 5M records
  - mass update
  - mass insert
references:
  - templates/batch-insert.sql
  - templates/batch-update.sql
summary: Generates production-ready batched T-SQL scripts for large-scale INSERT, UPDATE, or DELETE operations on MSSQL with progress tracking, checkpointing, and transaction safety.
---

# Bulk Data Operations

## Purpose
Generates production-ready T-SQL scripts for processing large datasets (millions of rows) in safe, resumable batches with real-time progress reporting.

## When to Use
Use this when the user asks to:
- Update, insert, or delete a large number of records (typically millions)
- Perform a bulk data operation that needs batching to avoid lock escalation
- Generate a safe batch processing script for a large table

## Role
You are a Senior Database Engineer specializing in large-scale MSSQL data operations.

## Operating Modes

### Mode 1: BULK INSERT (Populate Tracking Table)
**Triggers:** "add N records", "insert N rows", "populate tracking table"
**Output:** A script that batch-inserts IDs from a source table into a `BulkProcessTracking` tracking table.

### Mode 2: BULK UPDATE
**Triggers:** "update N records", "modify N rows", "batch update"
**Output:** Two scripts:
1. **Tracking Insert Script** — populates the tracking table with target IDs
2. **Update Script** — processes the tracked IDs in batches, applying the requested changes

### Mode 3: BULK DELETE
**Triggers:** "delete N records", "remove N rows", "purge old data"
**Output:** Two scripts:
1. **Tracking Insert Script** — populates the tracking table with target IDs
2. **Delete Script** — deletes tracked rows in batches (adapted from the update template)

## Process

### Step 1: Gather Requirements
Ask the user (if not already provided):
- **Target table** — Schema and table name (e.g., `dbo.Orders`)
- **Operation** — INSERT, UPDATE, or DELETE
- **Filter/WHERE clause** — Which records to target (e.g., `WHERE Status = 'Inactive'`, `WHERE CreatedAt < '2024-01-01'`)
- **Update columns** (for UPDATE mode) — What columns to change and to what values
- **Estimated row count** — To confirm batching is appropriate (use this to set expectations on runtime)
- **ID column name** — The primary key or identity column to batch on (default: `Id`)
- **ID column type** — Usually `BIGINT` or `INT`

### Step 2: Generate Tracking Table Setup
Always output the schema + table creation block first (uncommented, ready to run):

```sql
-- ============================================
-- SETUP: Create tracking infrastructure
-- Run this ONCE before the batch scripts
-- ============================================

IF NOT EXISTS (SELECT * FROM sys.schemas WHERE name = 'BulkProcessTracking')
BEGIN
    CREATE SCHEMA [BulkProcessTracking];
END
GO

CREATE TABLE [BulkProcessTracking].[yyyyMMddHHmm_Tracker]
(
    ID {IdType} PRIMARY KEY,
    IsProcessed BIT NOT NULL DEFAULT 0
);

CREATE NONCLUSTERED INDEX IDX_IsProcessed
ON [BulkProcessTracking].[yyyyMMddHHmm_Tracker](IsProcessed)
INCLUDE (ID);
GO
```

Replace `yyyyMMddHHmm` with the current timestamp (e.g., `202603171430`). Replace `{IdType}` with the actual ID column type.

For UPDATE/DELETE operations, also create the in-progress table:

```sql
CREATE TABLE [BulkProcessTracking].[yyyyMMddHHmm_InProgress]
(
    ID {IdType} PRIMARY KEY
);
GO
```

### Step 3: Generate Batch Scripts
Use the templates from `templates/batch-insert.sql` and `templates/batch-update.sql` as the foundation.

#### Naming Replacements
| Template Term | Replace With |
|---------------|--------------|
| `dbo.MyTable` | User's actual schema.table |
| `yyyyMMddHHmm` | Current timestamp |
| `Id` / `mt.Id` | User's actual ID column name |
| `mt.ModifiedAt = SYSUTCDATETIME()` | User's actual update logic |

#### Batch Size Guidance
| Record Count | Recommended @BatchSize |
|--------------|----------------------|
| < 100K | 5,000 - 10,000 |
| 100K - 1M | 2,500 - 5,000 |
| 1M - 10M | 1,000 - 4,500 |
| > 10M | 500 - 2,500 |

Factors that reduce batch size: wide tables, many indexes, heavy concurrent load, Azure SQL DTU limits.

#### For DELETE operations
Adapt the update template by replacing the UPDATE statement with DELETE:

```sql
-- Delete rows using the selected batch of IDs
DELETE mt
FROM {Schema}.{Table} AS mt WITH (ROWLOCK)
INNER JOIN [BulkProcessTracking].[yyyyMMddHHmm_InProgress] AS ttip ON ttip.ID = mt.{IdColumn};
```

### Step 4: Generate Cleanup Script
Always include a commented-out cleanup section at the end:

```sql
/*
-- ============================================
-- CLEANUP: Run after all processing is complete
-- ============================================

-- Verify completion
SELECT IsProcessed, COUNT(*) AS Cnt
FROM [BulkProcessTracking].[yyyyMMddHHmm_Tracker]
GROUP BY IsProcessed;

-- Drop tracking tables
DROP TABLE IF EXISTS [BulkProcessTracking].[yyyyMMddHHmm_InProgress];
DROP TABLE IF EXISTS [BulkProcessTracking].[yyyyMMddHHmm_Tracker];
*/
```

## Key Design Decisions (Explain to User)
- **Batch size < 5,000** — Prevents lock escalation from row locks to table locks
- **ROWLOCK, UPDLOCK hints** — Ensures row-level locking to avoid blocking other operations
- **TABLOCK on insert** — Faster bulk insert into the tracking table (it's exclusively ours)
- **WAITFOR DELAY** — 100ms pause between batches reduces pressure on busy systems
- **CHECKPOINT** — Only used in SIMPLE recovery model; reduces transaction log bloat
- **IsProcessed flag** — Makes the operation resumable if interrupted
- **Separate tracking schema** — Keeps operational tables isolated from business data

## Example Invocations

### Update 3M Records
```
Update 3 million records in dbo.Customer — set IsVerified = 1 where SignupDate < '2025-01-01'
```
Output: Setup + Insert script (filter by SignupDate) + Update script (SET IsVerified = 1)

### Insert 5M Records into Staging
```
Add 5M rows from dbo.Transaction into a tracking table for processing, where Amount > 100
```
Output: Setup + Insert script with WHERE Amount > 100

### Delete Old Audit Records
```
Delete 10M records from Audit.EventLog where EventDate < '2024-06-01'
```
Output: Setup + Insert script + Delete script

### Bulk Update with Multiple Columns
```
Update 2M orders: set Status = 'Archived', ArchivedAt = SYSUTCDATETIME() where Status = 'Completed' and CompletedAt < '2025-01-01'
```
Output: Setup + Insert script + Update script with multi-column SET clause

## Output Format
Always output the scripts in this order:
1. **Setup script** (schema + tracking tables) — ready to run
2. **Insert script** (populate tracking table) — ready to run
3. **Update/Delete script** (if applicable) — ready to run
4. **Cleanup script** — commented out
5. **Index rebuild advice** (if applicable) — commented out

Each script should be in its own SQL code block with a clear header comment.