Work-boundaryengineering
AI is entering daily work as a drafter, analyst, reviewer, searcher, and tool user. The new problem is not only model quality. It is unmanaged delegation.
Toppy builds the evidence layer that defines what AI may do, when humans must intervene, what failures must be tested, and what remains reviewable after launch.
Current stage: public engineering proof, early commercial validation. The company is deliberately narrow: work boundaries for teams moving from AI features to AI-operated work.
Thesis
AI adoption creates new management work
When AI systems perform work, the hard part is not only model quality. A company needs boundaries for decisions, approvals, evidence, and tool authority. Most teams have not named this job yet.
Work ownership gets unclear
Teams ship AI-mediated work before naming who owns approvals, exceptions, and post-launch review.
Decisions cross invisible lines
Recommendations, drafts, tool calls, and approvals blur together unless the work boundary is explicit.
Evidence disappears
Human review becomes performative when the workflow does not state what gets tested, logged, or preserved.
Tools become authority
Automation becomes risky when file access, network calls, and retention are broader than the work actually needs.
New tasks
The new tasks of AI-operated work
AI does not only automate tasks. It creates new ones: delegation boundaries, failure evidence, escalation paths, permission limits, incident reconstruction, and visible human responsibility.
Delegation design
- Define what AI may do, suggest, execute, or hand back.
- Keep human responsibility visible when work crosses a boundary.
- Design escalation paths for exceptions, risky outputs, and failed checks.
Evidence operations
- Test failure modes before launch.
- Log the evidence needed for review after launch.
- Reconstruct incidents from inputs, tool calls, rules, and missed interventions.
Permission surface
- Restrict file, network, retention, and tool authority to the job.
- Gate dangerous actions instead of relying on optimistic prompts.
- Make tool authority narrow enough to audit.
Wedge
A new operating category: work boundary engineering
Our wedge is the boundary layer between AI systems and the work they are allowed to perform. It is narrower than governance, more operational than evaluation, and still underbuilt in ordinary AI adoption programs.
Why now
AI adoption moved faster than responsibility design. Large-enterprise governance is often too heavy, while small teams still need boundaries, logs, approvals, and tool rules they can operate.
Best-fit teams
- B2B SaaS teams moving from AI features to AI-operated workflows.
- Operators using LLMs internally without clear approval, evidence, or stop rules.
- Small technical organizations that need automation inside security-sensitive work.
Product path
We start as a productized service. We are learning the recurring work patterns of AI-operated teams. The long-term product is the evidence layer for delegated AI work.
What we are not
- Not an AI automation agency.
- Not a generic AI governance program.
- Not only model evaluation.
- Not prompt magic or dashboard theatre.
- Not risk hidden behind optimistic demos.
Proof
Proof artifacts as future-work capabilities
The public work is not a flat portfolio. It is evidence for the capabilities AI-operated work needs: verifiable outputs, reportable decisions, bounded tools, session-aware delegation, and public diagnostic signals.
Stage: public engineering proof, early commercial validation. Engineering evidence and market evidence are tracked separately.
Core proof: work-boundary engineering
YOLOZU
YOLOZU shows the ability to make AI outputs comparable and verifiable: define the output boundary once, validate it, then compare across frameworks without silent metric drift.
AuditLoop
AuditLoop shows the ability to make AI decisions reportable: what the system may do, what evidence must exist, and when humans take authority back.
Calibration and failure-mode work
Public research around overconfidence, filters, and large language models supports the boundary decisions behind client-facing reports.
Security surface proof
VSCode PDF Viewer Secure
VSCode PDF Viewer Secure shows the ability to design a narrow tool surface: offline-first, read-only by default, with restrictive browser behavior and explicit feature enablement.
Agents Secure Binding
Agents Secure Binding shows the ability to reason about agent delegation and session boundaries through a verifier-side acceptance profile for authority grants, holder proof, live sessions, freshness, attestation results, and local policy.
dmarc4all
DMARC4all shows the ability to turn public signals into operating evidence through privacy-first diagnostics and bounded data handling.
RFC quizzes
Spec-driven educational pages for HTTP, TLS, and QUIC show disciplined explanation, information design, and long-term maintenance.
Engineering proof vs. market proof
Engineering proof
Public releases, repository history, documentation, reproducible demos, security posture, and archival records are treated as technical evidence.
Market proof
Users, installs, inquiries, pilots, paid work, and repeat use are tracked separately so early traction is not overstated.
Current public metrics
- YOLOZU: PyPI latest 4.4.0, released 2026-04-29; 1,137 public GitHub commits; 2 stars and 0 forks.
- VSCode PDF Viewer Secure: 69 Visual Studio Marketplace installs and 0 ratings.
- Checked 2026-06-13. These are public product signals; market proof remains early.
Services
What you can hire Toppy for
Services are delivery formats for Work Boundary Engineering: make AI-operated work explicit enough to buy, finish, approve, log, test, and constrain.
AI Work Boundary Sprint
For a founder, product lead, or operator preparing one AI-operated workflow for launch.
Deliverables: boundary map, approval matrix, evidence-log template, stop rules, and unresolved-risk list.
- Before: the team knows the AI feature but not the operating boundary.
- After: the team knows what AI may do, what humans own, and what must be logged.
- Risks reduced: unclear ownership, silent overreach, missing review evidence.
- Trust artifacts: boundary map, failure cases, approval rules, evidence checklist.
Delegation Evidence Report
For teams already using LLMs internally or shipping agentic workflow features.
Deliverables: delegation rules, output evidence, failure modes, review criteria, audit-log requirements, and implementation priorities.
- Before: prompts, tools, reviewers, and approvals are mixed inside one workflow.
- After: each decision has an owner, a stop condition, and an evidence requirement.
- Risks reduced: confused-deputy behavior, audit gaps, approval theatre, scope drift.
- Trust artifacts: workflow map, test pack, reviewer policy, launch-risk note.
Agent Permission Surface Review
For teams that need a narrow internal tool or feature surface around AI-operated work.
Deliverables: permission boundary, network/file/data-retention limits, dangerous-action gates, release checklist, and scoped implementation patch when appropriate.
- Before: permissions, files, network calls, and retention are broader than the job.
- After: the tool surface matches the work boundary and fails closed by default.
- Risks reduced: data overexposure, unsafe tool execution, cache leakage, broad enablement.
- Trust artifacts: feature policy, security notes, release checklist, public proof when possible.
AI Incident Reconstruction
For teams that need to explain an AI output incident, bad recommendation, or missed intervention after it happened.
Deliverables: input timeline, referenced context, failed rule, human-intervention point, evidence gaps, and prevention checklist.
- Before: the team has an incident but not a clear account of how it happened.
- After: the team can explain what was seen, what was done, and where the boundary failed.
- Risks reduced: repeated failure, blame ambiguity, unreviewable logs, weak corrective action.
- Trust artifacts: reconstruction report, evidence timeline, missed-stop note, prevention checklist.
Sample artifact
Example: work-boundary record
A project output should be specific enough to operate. A lightweight record can show the allowed work, test evidence, escalation rule, tool limits, and unresolved risk in one place.
Product proof
Product proof details
These details support the proof hierarchy above. They show how work-boundary engineering becomes public software, security profiles, or a bounded managed service.
VSCode PDF Viewer Secure
Security-first, offline-first PDF viewing for Visual Studio Code. Built for environments that prefer conservative defaults, reduced attack surface, and explicit feature enablement.
Agents Secure Binding
Technical note and link hub for a session-bound Agent identity profile. The profile separates authority grants, holder proof, live session evidence, freshness, and local policy before the application treats a peer as the intended Agent.
dmarc4all
Managed DMARC/SPF/DKIM setup and reporting for teams that want a maintained public service rather than a custom consulting engagement.
YOLOZU
Framework-agnostic vision model evaluation toolkit centered on a stable predictions.json interface contract for detection, segmentation, keypoints, depth, and 6DoF pose.
Speculative Lab
Exploratory work kept separate from delivery
We keep exploratory work separate from client delivery, but public research shows the modeling discipline brought to ambiguous systems.
Thermo-Credit theory and dashboard
Exploratory economic research on credit dynamics, paired with a public theory note and dashboard. Dashboard readings should be understood as latest available by region, with data gaps treated as visible caveats.
Data freshness: current public report latest dates are JP 2025-03-31, EU 2020-03-31, and US 2025-03-31; checked 2026-06-13.
Kantian and control-theoretic work
Research artifacts remain labeled as research, not service claims. Their role is to show how Toppy reasons about uncertainty, overconfidence, and control boundaries.
Company
Who we serve and how we work
Who Toppy is for
ToppyMicroServices is built for founders, technical operators, and small product teams that need AI-operated work they can still inspect, explain, and control after launch.
The strongest fit is work-boundary design, responsibility mapping, evidence requirements, and bounded internal tool surfaces where accountability matters.
Founder / operator
Toppy is founder-led by Akira (thinksyncs), the public maintainer behind ToppyMicroServices artifacts across work-boundary engineering, security-bounded software surfaces, and exploratory reliability research.
The operating reason is simple: AI-performed work, human authority, and software boundaries should remain inspectable after launch. The site avoids unverifiable customer logos, certifications, or credentials.
Professional stance
We are an independent Estonia-based company focused on operating design for AI-performed work and security-bounded software delivery.
Exploratory research, public products, and client-facing services are labeled separately so the offer stays legible.
Contact
Design the work boundary
Tell us where AI is entering the work, what must remain human-owned, and which decisions need approval, evidence, logs, or software boundaries.