name: transcript-to-markdown description: Convert a Korean transcript (SRT/VTT/plain text) into a detailed Markdown document with a 3-line summary and chaptered notes with timestamps. Use when a user provides transcript text and wants a structured Markdown writeup with minimal content loss.

transcript-to-markdown

Overview

Use Codex CLI to turn transcripts into a detailed, structured Markdown document with minimal information loss. Output includes a 3-line summary, chaptered notes with timestamps, and quoted key statements.

Prerequisites

This skill works best with transcripts from yt-subs-whisper-translate. If you have a YouTube URL, generate the transcript first:

python3 yt-subs-whisper-translate/scripts/yt_subs_whisper_translate.py "<URL>"

Quick start

Basic usage:

python3 scripts/transcript_to_markdown.py --input transcript.srt --output notes.md --title "Video Title"

With description and chapters:

python3 scripts/transcript_to_markdown.py \
  --input transcript.srt \
  --output notes.md \
  --title "Video Title" \
  --description "Video description for context" \
  --chapters chapters.json

Workflow

Ingest transcript (SRT/VTT/plain text).
Smart chunking:
- If chapters file provided → chunk by chapters
- If chapter > 20 min → sub-chunk with 3 min overlap
- If no chapters and > 20 min → time-based chunking (10 min chunks, 3 min overlap)
Process each chunk with Codex CLI.
Merge chunks with full context (title + description).
Claude final review and approval.

Chunking logic

Condition	Strategy
Has chapters, each < 20 min	Use chapter boundaries
Has chapters, some > 20 min	Sub-chunk long chapters with overlap
No chapters, total < 20 min	Process as single chunk
No chapters, total > 20 min	10 min chunks with 3 min overlap

Input formats

SRT/VTT

Standard subtitle format with timestamps.

Plain text

Requires [MM:SS] prefix per line:

[00:15] First point here
[01:30] Second point here

Chapters file

JSON format (yt-dlp compatible):

[
  {"start_time": 0, "end_time": 300, "title": "Introduction"},
  {"start_time": 300, "end_time": 900, "title": "Main Topic"}
]

Or simple text format:

00:00 Introduction
05:00 Main Topic
15:00 Conclusion

Output format

# Video Title

1. First summary point
2. Second summary point
3. Third summary point

## Chapter 1: Topic Name (00:00-05:00)

- [00:15] Detailed note with timestamp
- [00:45] Another important point
  - Supporting detail
  - Additional context

> "[01:30] Important quote from the speaker" — Speaker Name

## Chapter 2: Next Topic (05:00-15:00)
...

Key features

Timestamps on every bullet: [MM:SS] format for easy video reference
Blockquotes for key statements: Important quotes preserved with speaker attribution
Minimal information loss: Detailed notes, not summaries
Same language output: Matches transcript language (default: Korean)

Step 6: Claude final review and approval

After the merge pass completes, Claude must review the final Markdown output.

Review process:

Read the complete Markdown file.
Check for:
- Timestamp accuracy and consistency
- Proper quote formatting for key statements
- Chapter organization and flow
- Missing or duplicated content from chunk overlaps
- Natural language and readability
Make direct edits to fix any issues found.
Report a summary of changes made (if any) to the user.

Expected outputs

notes.md — Final structured Markdown document
chunks/ — Intermediate files (deleted unless --keep-chunks)

transcript-to-markdown

Overview

name: transcript-to-markdown description: Convert a Korean transcript (SRT/VTT/plain text) into a detailed Markdown document with a 3-line summary and chaptered notes with timestamps. Use when a user provides transcript text and wants a structured Markdown writeup with minimal content loss.

transcript-to-markdown

Overview

Prerequisites

Quick start

Workflow

Chunking logic

Input formats

SRT/VTT

Plain text

Chapters file

Output format

Key features

Step 6: Claude final review and approval

Expected outputs

What This Skill Can Do

Ready to use this skill?

Related Skills