An AI Reddit comment generator built for indie founders, not for SaaS dashboards

Almost every tool currently sold as an "AI Reddit comment generator" works the same way: you paste a Reddit URL, the model reads the thread, you get a comment back. That shape is fine for an agency cranking out client comments. It is the wrong shape for an indie founder who runs their own account, ships their own product, and cannot afford a single obviously-fake first-person anecdote to land on a 6-week-old account.

Matthew Diakonov, Written with AI

Published May 10, 202610 min

Direct answer (verified 2026-05-10)

For an indie founder running their own Reddit account, the honest answer is to use an open-source generator the founder can read, run on their own machine, and bill against their own LLM credits. Hosted URL-in / comment-out tools (Junia, Planable, Postiz, MediaFast, ReplyPilot) generate one comment from one URL with no memory of what you posted yesterday and no constraint against the model inventing personal specifics. Both of those gaps are how founder accounts get flagged.

The pattern this page argues for, with source you can verify: github.com/m13v/social-autoposter. Two load-bearing files: scripts/engagement_styles.py (the seven hardcoded archetypes and the grounding rule) and scripts/engage_reddit.py (the per-comment orchestrator that builds the prompt and posts via CDP).

The two things every paste-a-URL generator gets wrong

Imagine the typical hosted Reddit comment generator. You paste a thread URL into a web form. The backend fetches the post, calls a language model with a prompt that looks roughly like "here is a Reddit thread, write a 2 to 4 sentence reply that matches the subreddit tone", and renders the result. There is a free tier (one or two comments per session). There is a $15 to $30 monthly plan. Some of them post for you, most have you copy-paste the comment back into Reddit yourself.

The shape works for an agency. The agency runs hundreds of accounts across hundreds of clients. Voice flattening on any single account is not a problem because nobody is watching any single account closely. Hallucinated specifics are not a problem because the agency does not have a personal brand on the line. For an indie founder running one account that maps to one product and one face, both failure modes are existential.

Voice flattening first. A language model with no memory of your last few replies will default to its strongest rhythm: a soft hedged opener, a 2 to 3 sentence middle, a gentle question at the end. That rhythm is exactly what a moderator's pattern-match picks up. Within a week of running a URL-in tool on the same account, the comments start sounding like each other. A regular in your subreddit DMs you to ask if a bot took over. The moderator does not even need to read the comments; the cadence is enough.

Hallucinated specifics second. With only a URL as context, the model has no grounding for any first-person claim. So when the thread asks "has anyone tried this in production?", the model invents a production. It writes "I ran 22 cameras across three properties for 8 months". The founder pastes it. Two weeks later someone in the thread replies "wait, what three properties?" and the founder has nothing. On a 6-week-old account, that is the report-and-shadowban window opening.

Where the two fixes live in the code

The open-source repo fixes both problems with two small, specific mechanisms. Neither is impressive on its own; the point is that they exist at all. The diagram below traces what happens to a single pending reply between the cron firing and the comment landing on Reddit.

One Reddit reply, end to end

Two steps deserve a closer look. The call from engage_reddit.py to engagement_styles is where the last three archetypes get pulled and the prompt is told to vary away from them. The call to config.json is where the founder's own content_angle, voice, and messaging blocks get pasted verbatim into the prompt so the grounding rule has something concrete to anchor against. A URL-in tool has neither half of that loop.

Fix #1: seven archetypes that rotate against your last three replies

The first fix is the engagement archetypes. They live in the STYLES dict at the top of scripts/engagement_styles.py. Seven hardcoded entries:

critic — point out what is missing, flawed, or naive. Reframe the problem. Authority through a non-obvious insight, not credentials.
storyteller — narrative-driven, mandatory failure or surprise lede. Subject to the grounding rule (see below).
pattern_recognizer — name the pattern. "I have seen this play out dozens of times across X."
curious_probe — one follow-up question on the most interesting detail, prefixed with "curious because we ran into something similar". One question only. Disabled by platform policy on Reddit because question-only replies underperform.
contrarian — take a clear opposing position backed by experience. Empty hot takes get destroyed.
data_point_drop — share one specific, believable number. Let the number do the talking.
snarky_oneliner — short, sharp, emotionally resonant observation. Validates a shared frustration. Locked off on small or serious subs.

On its own that is a list. The mechanism is what makes it work. Before engage_reddit.py builds the prompt, it queries the replies table for the founder's last three engagement_style tags on Reddit. Those three names go into the prompt with an explicit instruction: vary away from them. The model is also handed a top_performers block computed from the last 14 days of those tags joined against engagement signals, so it can self-tune toward what is converting this week without anyone editing the prompt by hand. A URL-in tool sees one Reddit thread and zero history, so its archetype distribution is whatever the base model defaults to. Within a week of running it on the same account, that distribution flattens.

The archetype list is not fixed: the model can invent new ones

One detail that surprised us when we shipped the rotation: the model started wanting to invent. About once a week, on a thread that did not fit any of the seven hardcoded archetypes, Opus 4.7 would return an engagement_style name we had never seen, with a companion JSON block called "new_style" explaining what it was and why the existing seven did not fit. We did not design this. We added a hook for it after the third time it happened.

validate_or_register() in engagement_styles.py picks those up. The new name lands in scripts/engagement_styles_extra.json with status "candidate". Candidate styles appear in subsequent prompts but the picker only allocates them STYLE_FLOOR_PCT (5 percent) of the distribution so a single weird invention cannot dominate. A nightly job in scripts/promote_engagement_styles.py graduates a candidate to "active" once enough samples land in the posts table with median engagement at or above platform baseline.

As of today the sidecar has three promoted entries running in active rotation:

agree_and_extend — validate the original framing, then add a non-obvious extension. Sits between pattern_recognizer and storyteller. Promoted 2026-04-29 after 5 Reddit posts at median 1.0 upvotes vs platform baseline 1.0.
share_experience — lived experience without storyteller's mandatory failure-lead. Promoted 2026-04-30. Still subject to the grounding rule: specifics must come from config, no fabricated headcount or durations.
technical_detail — a concrete implementation detail or version-specific behavior. Promoted after 4 Reddit and 4 Twitter posts at median engagement at platform baseline.

A founder running a hosted URL-in tool has no equivalent surface for this. Their archetype taxonomy is whatever the vendor decided at v1, frozen until the vendor pushes a release.

3 styles

“agree_and_extend, share_experience, and technical_detail all came from the model deciding the existing seven did not fit a real thread. The nightly promoter graduated them after engagement signal cleared platform baseline.”

scripts/engagement_styles_extra.json

Fix #2: the two-lane grounding rule

The second fix is the one that matters for staying on the right side of Reddit's moderation, and it is the part no hosted generator can replicate without seeing your config.json. The rule is defined in the get_grounding_rule() function in engagement_styles.py (around line 743) and gets pasted at the top of every reply prompt. It outranks every other style instruction. The prompt says so explicitly: "GROUNDING RULE (highest priority; overrides any other style guidance)".

How the grounding rule shapes one comment

Step 1. Pick the lane before drafting

Every reply is forced to commit to one lane up front. Niche or expert subreddits push the model toward Lane 2 because the audience punishes any whiff of fabrication. Casual or advice subs tolerate Lane 1 when the disclosure feels natural.

Lane 1. Disclosed story

Open with a phrase that signals the story is illustration, not lived testimony. Acceptable openers, all listed verbatim in the prompt: 'hypothetically', 'imagine someone running this', 'say a friend tried', 'as a thought experiment', 'scenario:', 'to make this concrete, picture', 'made-up example but'. After the opener, the model has full creative license on numbers, durations, headcount, named programs.

Lane 2. No fabrication, first person OK

Stay plain-voiced. Any specific (number, duration, date, place name, course or program name, headcount, named tool, named person) is allowed only when it appears verbatim in the matched project's content_angle, voice (tone, examples_good), or messaging (lead_with_pain, solution, proof) in config.json. If a specific is not in config: drop it, generalize ('a few months', 'a handful of cameras'), or pattern-frame ('the part that breaks down is...', 'the typical failure mode is...'). Pattern-frame counts as observation, not autobiography, so no disclosure is needed.

Never mix the lanes

An invented specific presented as a personal first-hand claim with no disclosure is the exact failure mode the rule exists to kill. The prompt names this directly with a bad example ('i ran 22 cameras for 8 months and we were getting 400+ person detected pings a night') and gives both rewrites (Lane 1 with a 'scenario:' opener, Lane 2 with pattern-framing and no invented numbers).

Tag the reply, write the row, close the loop

The model emits a single JSON object with the archetype it picked. The orchestrator parses it, posts via CDP, re-reads the DOM to verify the comment landed, then writes the row to the replies table with the archetype tag. Two weeks later that tag feeds back into the rotation as a top_performers report so the prompt can self-tune toward what converted, without a human in the loop.

The rule reads as elaborate but it collapses to one sentence: every first-person specific in a generated comment must either be disclosed as illustration or be a verbatim copy from the founder's own product config. Nothing in between. The prompt calls out the exact failure mode in capital letters: NEVER MIX. Do not write "i ran 22 cameras for 8 months" without either a Lane 1 disclosure in front of it, or those numbers being in config.json. That is the failure mode this rule exists to kill.

Two worked examples ship with the rule itself, drawn from real posts in our database. The bad one (fabricated personal anecdote, no disclosure, no config anchor): "i ran this exact pipeline last semester for two anatomy blocks, cheap recorder into whisper into gpt into anki, raw gpt got me about 35% usable cards...". The Lane 1 rewrite, same details, but disclosed: "hypothetically, imagine running this for a couple of lecture blocks: cheap recorder into whisper into gpt into anki. raw prompts get you somewhere around a third usable cards before duplicate distractors and trivial restatements take over." The Lane 2 rewrite, no invented specifics: "the whisper-to-gpt-to-anki setup isn't where this breaks. card generation is. raw prompts produce roughly a third usable before duplicate distractors and trivial restatements take over."

All three convey the same insight. Only the third can be safely posted from a founder's real account without setting up a future contradiction.

Side by side with the hosted generators

This is the section where comparison pages usually fudge the numbers. The honest version: hosted tools are easier to start with, they have nicer onboarding, and a founder who values their evening more than their account ban risk should probably use one of them. The table is the cost the indie founder pays for the convenience.

Feature	Hosted URL-in tools	S4L (open-source CLI)
Account location	Founder signs into the vendor's app or browser extension, vendor sees the session	Persistent Chromium profile on the founder's machine. No vendor account exists
LLM billing	Bundled into a $15 to $30 per month seat	Founder's own Anthropic OAuth or API key. ~$4.73 per Reddit comment on Opus 4.7
Archetype rotation	None. Each generation is independent	Last three engagement_styles passed to the prompt; model must vary
Grounding rule against fabricated specifics	None. Model is free to invent first-person numbers and durations	Two lanes enforced in the prompt: disclosed-fiction OR no-fabrication
Per-founder voice context	Subreddit tone match only	config.json content_angle, voice, messaging passed in; Lane 2 specifics must appear verbatim
New style invention	Static taxonomy	Model can ship a new_style block; sidecar promoter graduates after engagement signal
Source you can read before trusting it	Closed source	github.com/m13v/social-autoposter
Posting layer	Vendor's posting infrastructure	Python orchestrator drives CDP, re-reads DOM to verify, then writes the row

Junia, Planable, Postiz, MediaFast, ReplyPilot all sit in the 'hosted URL-in tools' column for the dimensions above. ReplyAgent is a separate shape (managed posting accounts at $3 per comment), which solves a different problem (account inventory) and inherits the same archetype-and-grounding gaps.

When a hosted generator is actually the right call

Three cases where the argument above flips. First, an indie founder who is testing whether Reddit is even the right channel for their product. Spending an afternoon wiring up a CLI before you know if any of the subreddits in your space respond to your angle is premature. A free tier on Junia or Planable is the cheaper experiment. Once the answer is "yes, this channel works", the trade-offs invert and the open-source shape starts paying off.

Second, a founder who genuinely cannot run a local CLI. The S4L install needs Node.js 16+, Python 3.9+, Claude Code CLI on PATH, psql, and a Neon Postgres database. That is a bar a developer founder clears in fifteen minutes and a non-technical founder cannot clear at all. The hosted tools exist for the second group.

Third, anyone doing this for clients at scale. The argument here is built on the assumption that one Reddit account maps to one founder and one product. An agency cycling through dozens of client accounts has different incentives. The open-source repo's qualification block names this directly as a disqualifier: "marketing agency looking to resell an autoposter to clients at scale". The shape was not designed for that shape.

What you actually do, concretely, this week

If you have read this far and the shape still fits you, the install is two commands. One to scaffold the repo into ~/social-autoposter/:

npx social-autoposter init

And then, inside a Claude Code session, "set up social autoposter". The skill walks through verifying the Neon connection, filling in config.json with your handles, running a 5-question interview to draft your content_angle (which is what Lane 2 of the grounding rule will later anchor against), and verifying the Reddit browser login in a persistent Chromium profile.

The first comment will land somewhere between three and ten minutes after the engage launchd job loads. A row in the replies table will record the archetype the model picked, the session UUID, the token usage, and the post URL. If the comment is bad, you read the row, find the prompt, fix it. The loop closes.

Want help wiring this onto your accounts?

Twenty minutes on a call, we walk through the subreddits you care about, your config.json so Lane 2 grounding has something to anchor on, and the first three archetypes worth running on your account. No deck.

Common questions from indie founders evaluating this

What is wrong with paste-a-URL Reddit comment generators for indie founders?

Two failure modes. First, they have no memory of what you posted yesterday, so the model defaults to its strongest pattern over and over and your voice flattens within a week. Founders who run their own account notice this the moment a regular commenter on their subreddit DMs them asking if a bot took over. Second, with only a URL as input the model has no grounding for first-person specifics, so it invents them. 'I ran 22 cameras for 8 months' lands on a thread where the founder has never seen 22 cameras in their life. On a brand-new account the second one of those gets reported, the shadowban window starts.

What is the 'grounding rule' and where does it live in the code?

It is a prompt block defined in scripts/engagement_styles.py at the get_grounding_rule() function (around line 743). Every reply prompt forces the model to pick ONE of two lanes. Lane 1 (DISCLOSED STORY) opens with a phrase like 'hypothetically', 'imagine someone running this', 'scenario:' or 'made-up example but', after which the model can invent any specifics it wants. Lane 2 (NO FABRICATION) keeps first-person voice only when every number, duration, date, place name, course name, headcount, or named tool appears verbatim in the founder's own config.json under content_angle, voice, or messaging. If a specific is not in the config, the model must drop it, generalize it, or pattern-frame it ('the part that breaks down is...'). Mixing the lanes (invented specific presented as a personal first-hand claim with no disclosure) is the exact failure mode the rule exists to kill.

Which seven archetypes does the generator rotate through?

critic, storyteller, pattern_recognizer, curious_probe, contrarian, data_point_drop, snarky_oneliner. They are the keys of the STYLES dict at the top of scripts/engagement_styles.py. Each has a one-line description, an example, a best_in map of subreddits where it tends to score, and a note flagging anti-patterns (no nitpicking for critic, no multi-question stacks for curious_probe, no snark in r/vipassana, and so on). On Reddit the curious_probe archetype is currently disabled by platform policy because question-only replies underperform and read as bait. The orchestrator passes the last three engagement_styles used to the prompt, and the model is instructed to vary away from them.

Can the model invent new archetypes, and what stops a bad one from taking over?

Yes. When the model returns a JSON object with an engagement_style that is not in the seven hardcoded names, it can ship a 'new_style' block alongside it describing the invention. validate_or_register() in scripts/engagement_styles.py picks that up and writes it to scripts/engagement_styles_extra.json with status 'candidate'. Candidate styles appear in subsequent prompts but the picker only allocates them the STYLE_FLOOR_PCT weight (currently 5 percent) so a single weird invention cannot dominate. A nightly job in scripts/promote_engagement_styles.py promotes a candidate to 'active' only after enough samples land in the posts table with non-bad engagement signals. As of 2026-05-10 the sidecar already has agree_and_extend, share_experience, and technical_detail in active rotation, each promoted after 4 to 6 posts with median engagement at or above platform baseline.

How does S4L compare to Junia, Planable, Postiz, MediaFast, ReplyPilot on price?

Hosted generators bill in two shapes. Junia, Planable, Postiz, and ReplyPilot have a free tier (one or two comments per session, rate limited) plus a paid plan ($15 to $30 a month). MediaFast is subscription only with a founder bundle. ReplyAgent charges $3 per successfully posted comment using managed accounts. S4L is open source and unpriced. You bring your own LLM credits (Anthropic OAuth on the Claude Pro plan is what we use, so the marginal cost lands at about $4.73 per Reddit comment on Opus 4.7 across 4,592 sessions we have logged). The mental model is closer to running a Python script than buying a SaaS seat.

Why does an indie founder care about archetype rotation specifically?

Most founder accounts on Reddit are one or two months old and have under 1,000 karma. The subreddit moderators are watching for two things: cross-subreddit promotional behavior, and stylistic uniformity that reads as automation. Stylistic uniformity is the easier signal to fire on because it does not require a moderator to click through your history. If every comment opens with the same hedge, lands the same length, and ends with the same kind of soft question, a single mod scan flags the pattern. Rotating archetypes is a direct mitigation. It forces variance the model would not produce on its own. The model defaults to the rhythm that worked last week; the rotation prevents that rhythm from accumulating.

Do I need to expose my Reddit credentials to a third party to use S4L?

No. S4L runs on the founder's machine. The Reddit browser session lives in a persistent Chromium profile at ~/.claude/browser-profiles/reddit/ that you log into once via the regular Reddit web UI. The session cookie never leaves that profile directory. The Python orchestrator drives that browser via CDP to post. The LLM call is your own (Anthropic OAuth or API key). There is no S4L cloud, no centrally managed account inventory, and no place a vendor could leak your credentials from because there is no vendor account.

What does the JSON contract the model is forced to return look like?

Each reply prompt forces the model to emit exactly one JSON object, either {"action":"skip","reason":"..."} or {"action":"reply","text":"...","project":null,"engagement_style":"<one of the seven, or an invented name>","new_style":null}. The orchestrator parses that and does everything with side effects: it appends an active campaign suffix if one is matched, posts via CDP, re-reads the DOM to verify the comment text actually rendered, then writes the row to the replies table with the archetype tag and the session UUID. The split exists because letting the model do the actual posting is what gets accounts banned. Deterministic Python doing the post is auditable; an LLM tool call is not.