March 27, 20268 min read

Advanced Prompt Engineering for Code Generation: Techniques That Actually Work

Beyond basic prompting — practical techniques for getting better code from AI. System prompts, few-shot examples, chain-of-thought, structured output, and iterative refinement with real before/after comparisons.

prompt-engineering ai code-generation productivity tools

Most developers use AI code generation the same way: type a vague request, get mediocre code, manually fix it, repeat. The gap between a naive prompt and a well-crafted one is the difference between getting a rough draft and getting production-ready code.

Here's the thing — prompt engineering isn't magic. It's communication. The same skills that make you good at writing tickets and documentation make you good at prompting. You just need to know the specific techniques that work for code generation.

The Problem with Basic Prompts

Here's what most people type:

Write a function that validates email addresses

And here's what they get: a basic regex that misses edge cases, no error handling, no types, no tests. Then they spend 20 minutes fixing it — which defeats the purpose.

The issue isn't that the AI is dumb. It's that you gave it almost zero information about what you actually need. It's like asking a contractor to "build a wall" without specifying dimensions, materials, or location.

Technique 1: System Prompt Engineering

System prompts set the context, role, and constraints before you even ask your question. Most AI coding tools let you configure these — Cursor has .cursorrules, Claude Code has CLAUDE.md, and ChatGPT has custom instructions.

A good system prompt for code generation includes:

You are a senior TypeScript developer working on a Node.js backend.
Follow these constraints:

Use TypeScript strict mode (no any types)
Prefer functional programming over classes
Use zod for runtime validation
Handle errors explicitly (no swallowing exceptions)
Include JSDoc comments on public functions
Use descriptive variable names (no single letters except loop indices)
Output format: code only, no explanations unless asked.

This eliminates entire categories of problems. Instead of getting JavaScript when you wanted TypeScript, or getting classes when your codebase uses functions, the AI starts from the right baseline every time.

In my experience, a well-written system prompt saves more time than any individual prompting technique. Set it once, benefit on every interaction.

Technique 2: Few-Shot Examples

This is the single most underused technique. Instead of describing what you want, show the AI an input/output pair from your codebase.

Without few-shot:

Write a validation function for user registration data

With few-shot:

Here's how we write validators in this codebase:

// Input:
const productSchema = z.object({
  name: z.string().min(1).max(200),
  price: z.number().positive(),
  category: z.enum(["electronics", "clothing", "food"]),
});

export function validateProduct(data: unknown): Result<Product, ValidationError> {
  const parsed = productSchema.safeParse(data);
  if (!parsed.success) {
    return { ok: false, error: formatZodError(parsed.error) };
  }
  return { ok: true, value: parsed.data };
}

// Now write the same pattern for user registration with fields:
// email, password (min 8 chars), name, and optional phone number

The AI now knows your validation library (zod), your error handling pattern (Result type), your function naming convention, and your code organization. The output will match your codebase instead of being generic.

Technique 3: Chain-of-Thought

For complex problems, asking the AI to think before coding produces dramatically better results.

Without chain-of-thought:

Build a rate limiter middleware for Express

With chain-of-thought:

I need a rate limiter middleware for Express. Before writing code,
think step by step:


What algorithm to use (fixed window, sliding window, token bucket)?
What storage backend (in-memory for single server, Redis for distributed)?
What should the API look like for the middleware consumer?
What HTTP headers should we set (X-RateLimit-Limit, etc.)?
How should we handle rate-limited requests?

Then implement the solution.

This forces the AI to consider architecture before diving into code. The result handles edge cases the naive approach misses — like what happens when the server restarts, or how to configure different limits per route.

Technique 4: Structured Output

When you need code in a specific format, be explicit about the structure:

Generate a REST endpoint for creating a user.

Return the response in this exact structure:

Zod schema for request validation

Service function (pure business logic, no HTTP concerns)

Controller function (HTTP layer, calls the service)

Route registration

Integration test


Each section separated by a comment header like:
// === SCHEMA ===
// === SERVICE ===
// === CONTROLLER ===
// === ROUTE ===
// === TEST ===

This is particularly useful for generating boilerplate that follows your project's architecture. Instead of getting a monolithic function with business logic mixed into the route handler, you get properly separated concerns.

Don't try to get perfect code in one shot. Build up in passes:

// Pass 1: Core logic
Write a function that converts Markdown to HTML. Support headings,
paragraphs, bold, italic, and links.

// Pass 2: Error handling
Now add error handling. What happens with malformed Markdown?
Unclosed tags? Nested formatting?

// Pass 3: Types
Add TypeScript types. Define an options parameter for configuring
which elements to support.

// Pass 4: Performance
This will process documents up to 50MB. Optimize for large inputs.
Use streaming if appropriate.

// Pass 5: Tests
Write tests covering: basic formatting, edge cases, malformed input,
and performance with large documents.

Each pass builds on the previous output. The AI maintains context from earlier passes, so it's refactoring its own code rather than starting fresh. This produces better results than cramming every requirement into one enormous prompt.

Technique 6: Context Management

The biggest factor in code generation quality is the context you provide. Here's what to include:

Always include:

The file you're modifying (or similar files for new code)
Type definitions and interfaces the code needs to implement
Import paths and available utilities

Include when relevant:

Database schema or API contracts
Error handling patterns from your codebase
Test examples showing expected behavior

Summarize, don't dump:

// Bad: pasting 2000 lines of code
// Good:
Our User model has these fields: id (UUID), email (unique),
passwordHash, createdAt, updatedAt.
The UserService class has methods: create(), findById(),
findByEmail(), update(), delete().
We use the Result<T, E> pattern for error handling (never throw).

For large codebases, a summary file (like CLAUDE.md or .cursorrules) that describes architecture, conventions, and patterns lets the AI understand your project without reading every file.

Technique 7: Test-Driven Prompting

This is my favorite technique and it's surprisingly effective:

Write failing tests first for a password strength checker:

Minimum 8 characters
At least one uppercase, one lowercase, one number
At least one special character
No more than 3 consecutive identical characters
Not in a common passwords list

Use vitest. Write the tests, then write the implementation
that makes them pass.

The tests act as an unambiguous specification. The AI can verify its own implementation against concrete expectations rather than interpreting vague requirements.

Anti-Patterns to Avoid

The kitchen sink prompt: "Write a user authentication system with login, registration, password reset, email verification, 2FA, OAuth, session management, and remember me functionality." Too many requirements at once. Break it into multiple prompts. The vague prompt: "Make this code better." Better how? Faster? More readable? More type-safe? Be specific about what "better" means. No context prompt: Asking for code without showing what exists. The AI has no idea about your tech stack, coding conventions, or existing utilities. The copy-paste-and-complain cycle: Getting code, pasting it in, seeing it fail, then prompting "this doesn't work." Instead, include the error message and relevant context: "This throws TypeError: Cannot read property 'id' of undefined on line 23. The user object comes from the auth middleware and might be null when the token is expired."

Tools That Help

Most AI coding tools now support persistent context:

CLAUDE.md — Claude Code reads this file for project context, conventions, and constraints
.cursorrules — Cursor's equivalent for persistent system prompts
Codebase indexing — Cursor and GitHub Copilot index your project for better context
File references — tagging specific files in your prompt (@file in Cursor, direct paths in Claude Code)

The common thread: give the AI the same context you'd give a new team member. Architecture docs, coding conventions, example code, and the specific problem you're solving.

The Honest Take

Advanced prompt engineering won't turn a junior developer into a senior one. You still need to understand the code the AI generates, spot bugs, and make architectural decisions. What it does is eliminate the tedious parts — boilerplate, standard patterns, initial implementations — so you can focus on the interesting problems.

The developers I've seen get the most from AI code generation aren't prompt engineers. They're experienced developers who happen to communicate clearly about what they need. The "advanced" techniques above are really just structured communication.

If you want to build the programming fundamentals that make AI tools actually useful, CodeUp focuses on the concepts and patterns that matter — because understanding code is still more important than generating it.