Social Media Strategy

What Does "Caption" Mean? Caption Meaning Explained

10 min read
Illustration explaining the caption meaning with a photo, a video, and a figure each paired with its caption text

The caption meaning most people search for is simple: a caption is a short line of text that accompanies an image, video, figure, or article and explains, describes, or adds context to it. The exact role shifts with the setting — a photo caption, a social media caption, and a video caption are not quite the same thing — but the core idea never changes.

This guide explains what “caption” means in plain terms, walks through every major context where you’ll meet the word, untangles the common mix-ups (caption vs description vs subtitle), covers the word’s surprising origin, and finishes with how to write a caption that actually works.

Table of Contents

Caption Meaning: The Short Answer

A caption is supporting text attached to other content — most often an image, video, or post — that names it, describes it, or adds meaning. It is short by nature, usually a line or two, and it always points at something else rather than standing alone.

That single idea unlocks every use of the word. The text under a newspaper photo is a caption. The words you type beneath your Instagram post are a caption. The on-screen text during a muted video is a caption. Same word, same job: explain or enrich the thing it sits beside. Merriam-Webster defines it directly as “the explanatory comment or designation accompanying a pictorial illustration.”

As a verb, to caption something means to add that text — “she captioned the photo with a joke.”

Caption Definition Across Contexts

The word “caption” shows up in several distinct fields. The definition holds steady, but what the caption does changes with each one. Here is what a caption means in the contexts you’re most likely to encounter.

Social media captions

On platforms like Instagram, TikTok, and LinkedIn, a caption is the written text published alongside a photo, video, reel, or carousel. It adds context, voice, and personality — and unlike a simple photo description, it is a creative and strategic element designed to spark comments, saves, and shares. A strong social caption usually opens with a hook and ends with a call to action.

Photo and figure captions

This is the classic, dictionary-first meaning: a short descriptive or explanatory text — usually one or two sentences — placed directly below a photograph, picture, map, graph, or illustration in a book, newspaper, or magazine. In newspapers this line is also called a cutline. In academic and technical writing, figure and table captions add a number (“Figure 3”) and a source, and table captions traditionally sit above the table.

Closed captions and subtitles

In video, captioning is the process of converting audio into time-synced, on-screen text. Closed captions transcribe dialogue plus sound effects, music cues, and speaker labels so that deaf and hard-of-hearing viewers get the full experience. They can be toggled on or off; open captions are burned permanently into the video.

News and broadcast captions

In TV news and headlines, a caption is the short title or heading that introduces or labels a segment, lower-third, or chyron — the on-screen text identifying a speaker or topic.

In law, the caption is the formal heading of a court document showing where, when, and by what authority it was taken — the court name, the parties, and the case number at the top of a filing.

Where Did the Word “Caption” Come From?

“Caption” comes from the Latin captionem, from capere meaning “to take,” and originally meant a “taking” or “seizure” when it entered English in the late 14th century. Its journey to today’s meaning is a slow drift from law to layout.

By the mid-1600s, “caption” described the certificate at the head of a legal document recording a seizure or deposition. Because that text sat at the top, the sense widened to “the beginning of any document,” then to the “heading of a chapter or section” by 1789.

Finally, around 1919 and especially in American English, it landed on its modern meaning: the descriptive text below an illustration, as the Online Etymology Dictionary records. The word for the top of a page became the word for the line under a picture.

Here’s the fun part: because both trace back to capere (“to take, seize”), “caption” and “capture” are etymological cousins — a caption, in a sense, captures what an image is about. The verb sense (“to caption a photo”) followed later, recorded around 1901.

Caption vs Description vs Subtitle

These three words overlap enough to cause confusion, but each has a distinct job. The table below sorts them out.

TermWhat it isPrimary purposeWhere it appears
CaptionShort text tied to an image, video, or postIdentify, describe, or add meaningUnder photos; alongside social posts; on-screen in video
DescriptionLonger explanatory text about contentGive fuller detail or alt-text for accessibilityImage alt attributes; YouTube video descriptions; product pages
SubtitleOn-screen translation of spoken dialogueTranslate audio for viewers who can hear but not understand the languageForeign-language film and video

The sharpest line to remember is captions vs subtitles. Both are on-screen video text, but a caption assumes the viewer cannot hear and transcribes everything — dialogue, sound effects, music, speaker IDs — for accessibility. A subtitle assumes the viewer can hear and only translates the spoken words.

As Rev puts it, captions are about access; subtitles are about translation. Web accessibility guidance from the W3C Web Accessibility Initiative treats captions as a baseline requirement for video, not an optional extra.

Depending on context, a caption synonym might be any of these:

  • Heading or title — for the text introducing a document or section.
  • Legend — common in scientific and cartographic figures.
  • Cutline — the journalism term for the text under a news photo.
  • Subtitle — only when you specifically mean translated video dialogue.
  • Tagline or blurb — loosely, for short accompanying marketing text.

None of these is a perfect one-to-one swap. “Caption” remains the broadest and most flexible of the set, which is why it’s the default word across photography, publishing, video, and social media.

How to Write a Good Caption

Knowing the meaning of caption is one thing; writing one that earns attention is another. The rules differ slightly by context, but a few principles travel everywhere.

  1. Front-load the point. Whether it’s a photo cutline or a social post, put the most important information or the hook in the first few words. On social media, only the first ~125 characters show before the “…more” cutoff.
  2. Be specific, not generic. “Sunset” is a label; “the five minutes before the storm rolled in” is a caption. Specifics give the reader something the image alone can’t.
  3. Match the medium. A figure caption needs accuracy and a number. A social caption needs voice and a reason to engage. A closed caption needs completeness — including sound effects.
  4. Add value the content can’t. A good caption tells you something the picture or video doesn’t already show.
  5. End with intent. For social posts, that means a clear call to action — a question, a “save this,” or a “tag a friend.”

Caption Examples by Context

Seeing the definition in action makes it concrete. Here’s the same idea — a caption supporting other content — across four settings.

  • Photo caption (newspaper): “Mayor Reyes addresses volunteers at the riverside cleanup, Saturday morning.”
  • Figure caption (report): “Figure 2. Monthly active users by platform, Q1–Q4 2025. Source: internal analytics.”
  • Closed caption (video): “[upbeat music] NARRATOR: And that’s how the bridge was built.”
  • Social media caption (Instagram): “Three years, one tiny apartment, zero regrets. What’s the best decision you made this year? 👇”

Notice how the photo and figure captions stay neutral and descriptive, while the social caption adds voice and a question. Same word — but the social version is doing creative work the others aren’t.

Writing Social Media Captions Faster

Once you understand what a caption is, the day-to-day challenge is writing dozens of them without running dry — especially on social media, where every post needs a fresh one. That blank caption box is where most posts stall.

A free Instagram caption generator solves the blank-page problem: describe your photo, pick a vibe, and get ready-to-post options in seconds. For deeper craft and 130+ copy-and-paste examples, our guide to Instagram captions breaks down hooks, length, emojis, and hashtags by category.

If you’d rather skip switching between apps entirely, Outfeed AI is a chat-first social media manager that writes captions in your brand voice and schedules them across nine platforms from a single conversation — no dashboards, no menus. You can read what Outfeed AI is for the full picture, or start with Outfeed AI and turn “what should I caption this?” into a posted update in one chat.

Frequently Asked Questions

What does caption mean?

A caption is a short line of text that accompanies an image, video, figure, or article and explains, describes, or adds context to it. On a photo it sits underneath and identifies what you are seeing; on social media it is the text written alongside a post; on video it is the on-screen transcription of the audio. The common thread across every sense is that a caption is supporting text attached to other content.

What is a caption in simple words?

In simple words, a caption is the short text that goes with a picture, video, or post to tell you what it is or to add meaning. Think of the line printed under a newspaper photo, the words you type under your Instagram post, or the text that appears on screen during a muted video — all of those are captions.

What is the difference between a caption and a subtitle?

Captions and subtitles are both on-screen video text, but they serve different audiences. Captions transcribe everything a deaf or hard-of-hearing viewer needs — dialogue plus sound effects, music, and speaker labels — and assume the viewer cannot hear the audio. Subtitles assume the viewer can hear but does not understand the language, so they translate only the spoken dialogue. Captions are about accessibility; subtitles are about translation.

What is the origin of the word caption?

Caption comes from the Latin captionem (from capere, “to take”), entering English in the late 1300s meaning a “taking” or “seizure.” By the mid-1600s it referred to the heading at the top of a legal document, then to the heading of any document or chapter, and by 1919, especially in American usage, it took on its modern meaning: the descriptive text below an illustration.

What is a caption on social media?

On social media a caption is the written text you publish alongside a photo, video, reel, or carousel. It adds context, voice, and personality, and it often includes a hook, a call to action, and hashtags. Unlike a photo caption that simply describes the image, a social media caption is a creative and strategic element that drives comments, saves, and shares.

captions definitions social media writing