File: docs_site/docs/guide/basic-concepts.md

Recommend this page to a friend!

docs_site/docs/guide/basic-concepts.md

File:	`docs_site/docs/guide/basic-concepts.md`
Role:	Auxiliary data
Content type:	`text/markdown`
Description:	Auxiliary data
Class:	HyperFlow PHP Framework to develop AI agents
Author:	By Muhammad Umer Farooq
Last change:
Date:	6 days ago
Size:	`7,329 bytes`

Download

Basic concepts

HyperFlow is a self-improving agent framework. Instead of manually tuning an AI agent, you let another AI agent do it automatically.

The core idea comes from evolutionary computation and Quality-Diversity style archives: keep many agent versions, score them, and use strong ancestors as parents for the next mutation (the MetaAgent edits code). Workflow diagrams below match this narrative so you can read in one place.

Overview

flowchart TB
  subgraph evoLoop [Evolutionary loop]
    Sel[Select parent]
    Imp[MetaAgent improve]
    Ev[Evaluate TaskAgent]
    Sel --> Imp --> Ev
    Ev --> Sel
  end
  Arc[(Archive)]
  evoLoop <--> Arc

Workflow diagrams

Evolutionary loop (outer)

flowchart LR
  subgraph archiveBlock [Archive]
    Archive[(archive.jsonl)]
  end
  Select[selectParent]
  Setup[setupExecutor]
  Patches[applyLineagePatches]
  Meta[runMetaAgent]
  Eval[runHarnessAndReport]
  Save[updateArchive]
  Select --> Setup --> Patches --> Meta --> Eval --> Save
  Save --> Archive
  Archive --> Select

One generation (sequence)

Use participant id Main (not Loop) ? Mermaid reserves loop for control blocks.

sequenceDiagram
  participant Main as generateLoop
  participant Arch as archive
  participant Exec as executor
  participant Meta as metaAgent
  participant Task as taskAgent
  participant Dom as domain
  Main->>Arch: loadArchive and selectParent
  Main->>Exec: setup with patch chain
  Main->>Meta: improve from eval feedback
  Meta->>Exec: write diff and files
  Main->>Task: forward per task
  Task->>Dom: prediction
  Main->>Dom: evaluate prediction
  Main->>Arch: save scores and patches

TaskAgent vs MetaAgent (programs)

flowchart TB
  subgraph meta [MetaAgent]
    MIn[repoPath eval results score context]
    MTools[bash plus editor]
    MOut[patch and edited files]
    MIn --> MTools --> MOut
  end
  subgraph task [TaskAgent]
    TIn[task prompt]
    TTools[optional domain tools]
    TOut[prediction string]
    TIn --> TTools --> TOut
  end
  meta -->|mutates code prompts tools| task

Execution mode

flowchart TD
  Q[executionMode]
  Q -->|local| L[LocalExecutor temp dir]
  Q -->|docker| D[DockerExecutor per generation]

The two agents

TaskAgent ? the worker

The TaskAgent solves domain-specific tasks. It receives a formatted prompt, optionally uses tools, and returns a prediction.

Input: A task description (formatted by the domain harness).
Output: A prediction.
Tools: Domain-specific, optional.

MetaAgent ? the improver

The MetaAgent is the mutation operator. HyperFlow treats an agent as a computable program, so the MetaAgent can refine logic, prompts, tools, and strategies on disk (metacognitive self-modification).

Input: Repo path, evaluation results, parent score context.
Output: Patches / modified source files.
Tools: Built-in `bash` and `editor`.

How they cooperate

sequenceDiagram
  participant Meta as MetaAgent
  participant Disk as workspace
  participant Task as TaskAgent
  Meta->>Disk: edit prompts tools domain code
  Task->>Disk: read updated logic
  Task-->>Disk: predictions for harness

The evolutionary loop

The loop (see src/Core/GenerateLoop.php) runs generations until max_generations or early stop. Each generation typically:

Select parent from the archive.
Set up executor (currently LocalExecutor in PHP).
Run MetaAgent ? produce a new modification from failures and context.
Run TaskAgent through the harness.
Evaluate ? domain scores predictions; reports under the output directory.
Update archive ? append a JSONL snapshot with scores and logic changes.

The archive

The archive is an append-only JSONL file: each line is a full snapshot. Read the last line for current state. Lineage is a tree: parent_id points to the real ancestor, not necessarily the latest id.

flowchart TD
  n0[initial]
  n1[gen1 score 0.7]
  n2[gen2 score 0.65]
  n3[gen3 score 0.82]
  n4[gen4 score 0.85]
  n5[gen5 score 0.3]
  n0 --> n1
  n1 --> n2
  n1 --> n3
  n2 --> n4
  n0 --> n5

Why JSONL?

| | JSON | JSONL | | --- | --- | --- | | Structure | One object per file | One object per line | | Append | Rewrite file | Append line | | Latest state | Parse all | Read last line | | Typical use here | report.json, predictions.json | archive.jsonl |

Parent selection strategies

Chosen once in config for the whole run (select_parent.py):

| Strategy | Behavior | | --- | --- | | random | Uniform over valid parents ? max exploration | | latest | Most recent valid parent ? simple chain | | best | Highest score ? pure exploitation | | score_prop | Random weighted by score | | score_child_prop | Score-weighted with child penalty (default) |

Why not always best? You can get stuck in a local maximum. Child penalty uses: weight = (score + 0.01) � 1 / (1 + num_children).

Domains and evaluation

A Domain defines your benchmark: load tasks, format input, evaluate predictions, and report aggregates. Evaluators in evaluators.py include static_evaluator, llm_judge_evaluator, and human_feedback_evaluator. The harness (harness.py) runs the TaskAgent over tasks.

The harness

flowchart LR
  subgraph perTask [Per task]
    fi[formatInput] --> fw[agent.forward]
    fw --> ev[domain.evaluate]
  end

Predictions vs scores

| | Score | Prediction | | --- | --- | --- | | What | Number from 0 to 1 | Model output string | | Typical files | report.json | predictions.json | | Used for | Parent selection, ranking | User-facing output, debugging |

Executors

executor.py provides LocalExecutor (fast, dev) and DockerExecutor (sandboxed via docker.py).

flowchart LR
  subgraph ex [Executor API]
    setup[setup]
    wd[getWorkdir]
    diff[diff]
    copyOut[copyOut]
    cleanup[cleanup]
  end

Self-referential improvement (`prompts_dir`)

Editable prompt files let the MetaAgent change its own instructions over generations:

`meta_agent.txt`
`task_agent.txt`

flowchart LR
  PD[prompts_dir] --> MT[meta_agent.txt]
  PD --> TT[task_agent.txt]
  MA[MetaAgent] -->|read and write| MT
  MA -->|read and write| TT

Early termination

Best archive score 1.0 stops the loop.
The MetaAgent receives score context so it avoids needless edits when already passing.

Examples overview

| Example | Focus | | --- | --- | | Bash | Command generation | | Calculator | Tool code fixes | | Fact-check | Classification |

See Examples for commands.

Glossary

| Term | Definition | | --- | --- | | Archive | JSONL history of generations and scores | | Domain | Task suite + evaluation | | Evaluator | static / LLM judge / human | | Executor | Local or Docker workspace per generation | | Harness | Runs TaskAgent over domain tasks | | MetaAgent | Edits code to improve TaskAgent | | Parent | Archive node used as base for a child | | Patch | Diff from MetaAgent |

Next steps