Research foundations

Engineered from proven research, not improvised from a tutorial.

CODITECT begins with research. Every architectural decision in the platform is grounded in peer-reviewed work, regulatory frameworks, or well-established engineering patterns. AZ1.AI Inc. evolves on the frontier of AI technology as rapidly as the research is proven, integrating, improving, and evolving the absolute best and proven ideas for the benefit of every customer. The bibliography below cites the specific work behind every key decision.

How we work

Every CODITECT design choice answers a question of the form "what does the literature say is the best known solution to X". We do not adopt patterns because they are popular on social media or because a single engineering team blogged about them. We adopt patterns because the research community, regulatory bodies, or longitudinal industry experience has converged on them as the durable answer. Where the frontier is still moving (long-context retrieval, agent orchestration, model routing), we track the evidence and integrate as soon as a result is reproduced and stable.

The bibliography below is grouped by the architectural area each citation supports. Each entry cites the primary source in Chicago author-date format and notes the specific CODITECT decision it informs.

AI Harness, agent orchestration, foundation models

  • Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. "Attention Is All You Need." In Advances in Neural Information Processing Systems 30. Supports: the foundational architecture that makes today's foundation models possible. CODITECT's harness wraps these models without modifying them.
  • Yao, Shunyu, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, and Yuan Cao. 2022. "ReAct: Synergizing Reasoning and Acting in Language Models." arXiv:2210.03629. Supports: the reason-then-act loop CODITECT Blueprints implement deterministically.
  • Schick, Timo, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, and Thomas Scialom. 2023. "Toolformer: Language Models Can Teach Themselves to Use Tools." In Advances in Neural Information Processing Systems 36. Supports: tool-call orchestration as a learned competency, the basis for CODITECT's MCP and Agent SDK integration.
  • Park, Joon Sung, Joseph C. O'Brien, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, and Michael S. Bernstein. 2023. "Generative Agents: Interactive Simulacra of Human Behavior." In UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology. Supports: persistent memory, reflection, and planning loops in multi-agent systems - the structural pattern Blueprints externalise into a deterministic harness.
  • Wang, Lei, Chen Ma, Xueyang Feng, Zeyu Zhang, Hao Yang, Jingsen Zhang, Zhiyuan Chen, et al. 2024. "A Survey on Large Language Model Based Autonomous Agents." Frontiers of Computer Science 18 (6): 186345. Supports: the taxonomy of agent architectures CODITECT positions itself within - foundation-model-agnostic harness rather than agent framework.
  • Anthropic. 2024. "Claude 3 Model Card and Evaluations." Technical report. Supports: the choice of Claude as primary execution model in CODITECT today, on long-context reasoning benchmarks.
  • Kaplan, Jared, Sam McCandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, and Dario Amodei. 2020. "Scaling Laws for Neural Language Models." arXiv:2001.08361. Supports: the rationale for foundation-model-agnostic routing - the leader changes as scaling continues, so the harness must abstract the model.

Context engineering, retrieval, memory

  • Lewis, Patrick, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, et al. 2020. "Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks." In Advances in Neural Information Processing Systems 33. Supports: RAG as the canonical pattern for grounding model output in project memory - CODITECT's approach to per-project semantic context.
  • Karpukhin, Vladimir, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, and Wen-tau Yih. 2020. "Dense Passage Retrieval for Open-Domain Question Answering." In EMNLP 2020. Supports: dense embedding retrieval, the algorithmic basis of CODITECT's semantic search index across the audit trail.
  • Gao, Yunfan, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, and Haofen Wang. 2024. "Retrieval-Augmented Generation for Large Language Models: A Survey." arXiv:2312.10997. Supports: current best practice for hybrid structured-plus-semantic retrieval; CODITECT's query layer follows the survey's "Modular RAG" pattern.
  • Helland, Pat. 2007. "Immutability Changes Everything." In Proceedings of the Conference on Innovative Data Systems Research (CIDR). Supports: append-only data design as a first-class architectural choice - the basis for CODITECT's audit-trail tables.
  • Fowler, Martin. 2005. "Event Sourcing." martinfowler.com. Supports: the recorded-outcome design - state derived from an immutable log of events rather than overwriting rows.

Test selection, regression testing, automated test generation

  • Rothermel, Gregg, and Mary Jean Harrold. 1996. "Analyzing Regression Test Selection Techniques." IEEE Transactions on Software Engineering 22 (8): 529-51. Supports: the foundational treatment of safe test selection. CODITECT's "which tests must run for this change" answers Mark's first deficiency using this framework.
  • Yoo, Shin, and Mark Harman. 2012. "Regression Testing Minimization, Selection and Prioritization: A Survey." Software Testing, Verification and Reliability 22 (2): 67-120. Supports: the modern taxonomy of test-impact analysis CODITECT implements per change.
  • Memon, Atif, Zebao Gao, Bao Nguyen, Sanjeev Dhanda, Eric Nickell, Rob Siemborski, and John Micco. 2017. "Taming Google-Scale Continuous Testing." In ICSE-SEIP '17: Proceedings of the 39th International Conference on Software Engineering: Software Engineering in Practice Track. Supports: the scaling argument for selection - running every test on every commit is infeasible at any meaningful organisation. CODITECT's per-change selection is the only sustainable approach.
  • Schäfer, Max, Sarah Nadi, Aryaz Eghbali, and Frank Tip. 2024. "An Empirical Evaluation of Using Large Language Models for Automated Unit Test Generation." IEEE Transactions on Software Engineering 50 (1): 85-105. Supports: LLM-driven test generation as a viable, evaluated approach - the basis for CODITECT's edge-case test generation in BP-004.
  • Lemieux, Caroline, Jeevana Priya Inala, Shuvendu K. Lahiri, and Siddhartha Sen. 2023. "CODAMOSA: Escaping Coverage Plateaus in Test Generation with Pre-trained Large Language Models." In ICSE 2023. Supports: hybrid SBST + LLM test generation for hard-to-reach branches - the technique CODITECT applies when coverage probes flag a gap.

Software engineering practice: TDD, ADRs, design records

  • Beck, Kent. 2002. Test Driven Development: By Example. Boston: Addison-Wesley. Supports: the Red-Green-Refactor cycle CODITECT records and enforces.
  • Nygard, Michael T. 2011. "Documenting Architecture Decisions." Cognitect blog, November 15, 2011. Supports: the ADR format CODITECT uses as its canonical decision record.
  • Fowler, Martin. 2018. "Architectural Decision Records." martinfowler.com. Supports: ADR adoption at scale and the relationship between ADRs and ongoing change management.
  • Booch, Grady, Robert A. Maksimchuk, Michael W. Engle, Bobbi J. Young, Jim Conallen, and Kelli A. Houston. 2007. Object-Oriented Analysis and Design with Applications. 3rd ed. Boston: Addison-Wesley. Supports: SDD content discipline and decomposition principles.
  • Brooks, Frederick P., Jr. 1995. The Mythical Man-Month: Essays on Software Engineering. Anniversary ed. Reading, MA: Addison-Wesley. Supports: the analytical case that small, atomic, well-recorded change units beat big-bang releases - the structural principle behind Blueprints.

Audit trails, immutability, and signed records

  • Nakamoto, Satoshi. 2008. "Bitcoin: A Peer-to-Peer Electronic Cash System." Whitepaper. Supports: the canonical demonstration that hash-chained, append-only ledgers can resist mutation under adversarial conditions. The pattern - not the cryptocurrency - informs CODITECT's audit chain design.
  • Vogels, Werner. 2009. "Eventually Consistent." Communications of the ACM 52 (1): 40-44. Supports: the consistency model behind CODITECT's local-to-cloud sync design.
  • Merkle, Ralph C. 1988. "A Digital Signature Based on a Conventional Encryption Function." In Advances in Cryptology - CRYPTO '87, edited by Carl Pomerance, 369-78. Lecture Notes in Computer Science 293. Berlin: Springer. Supports: Merkle-tree audit-log structures for tamper-evidence at scale.
  • Schneier, Bruce, and John Kelsey. 1999. "Secure Audit Logs to Support Computer Forensics." ACM Transactions on Information and System Security 2 (2): 159-76. Supports: the threat model and remediation pattern for audit logs that must remain trustworthy after a host compromise.

AI governance, regulatory frameworks, validated software

  • National Institute of Standards and Technology. 2023. Artificial Intelligence Risk Management Framework (AI RMF 1.0). NIST AI 100-1. Gaithersburg, MD: NIST. Supports: the GOVERN/MAP/MEASURE/MANAGE function model CODITECT maps to control primitives on /compliance.
  • European Parliament and Council. 2024. Regulation (EU) 2024/1689 of 13 June 2024 Laying Down Harmonised Rules on Artificial Intelligence (Artificial Intelligence Act). Official Journal of the European Union L 2024/1689. Supports: the regulatory requirements for high-risk AI systems CODITECT addresses through Articles 9-15, 17, and 61.
  • International Organization for Standardization. 2023. ISO/IEC 42001:2023 - Information Technology - Artificial Intelligence - Management System. Geneva: ISO. Supports: the AI management-system clauses (5-10) CODITECT operates against.
  • International Organization for Standardization. 2022. ISO/IEC 27001:2022 - Information Security, Cybersecurity and Privacy Protection - Information Security Management Systems - Requirements. Geneva: ISO. Supports: the security control set CODITECT inherits and exposes (encryption, access control, audit).
  • National Association of Insurance Commissioners. 2023. Model Bulletin on the Use of Artificial Intelligence Systems by Insurers. December 4, 2023. Supports: insurance-specific AI governance requirements addressed on /compliance.
  • New York State Department of Financial Services. 2023. 23 NYCRR Part 500 - Cybersecurity Requirements for Financial Services Companies (as amended). Albany, NY: NYDFS. Supports: 72-hour incident reporting and AI-system cybersecurity requirements for the financial-services ICP.
  • U.S. Food and Drug Administration. 2023. 21 CFR Part 11: Electronic Records; Electronic Signatures. Rockville, MD: FDA. Supports: trustworthy electronic records and electronic-signature requirements - CODITECT's HMAC-signed, append-only audit tables address sections 11.10 and 11.50.
  • International Medical Device Regulators Forum. 2014. Software as a Medical Device (SaMD): Possible Framework for Risk Categorization and Corresponding Considerations. IMDRF/SaMD WG/N12FINAL:2014. Supports: the SaMD risk-categorisation framework CODITECT validates against (per ADR-292).
  • International Electrotechnical Commission. 2015. IEC 62304:2006/A1:2015 - Medical Device Software - Software Life Cycle Processes. Geneva: IEC. Supports: the SaMD software lifecycle controls CODITECT's QVS implements.
  • U.S. Food and Drug Administration. 2025. Computer Software Assurance for Production and Quality System Software: Guidance for Industry and Food and Drug Administration Staff. Final guidance, September 2025. Supports: risk-based validation that explicitly endorses automated testing as primary evidence - the regulatory tailwind behind CODITECT's QVS approach (cited in ADR-292).
  • International Society for Pharmaceutical Engineering. 2022. GAMP 5: A Risk-Based Approach to Compliant GxP Computerized Systems. 2nd ed. Tampa, FL: ISPE. Supports: the GAMP 5 Category 5 IQ/OQ/PQ qualification pattern CODITECT generates as audit byproducts.
  • U.S. Department of Health and Human Services. 2003. HIPAA Security Rule, 45 CFR Parts 160, 162, and 164. Supports: the safeguards (administrative, physical, technical) CODITECT applies to PHI in healthcare deployments.

Quality management, ITIL, and process frameworks

  • AXELOS. 2019. ITIL 4 Foundation. Edition 2019. Norwich, UK: TSO (The Stationery Office). Supports: the four ITIL practices CODITECT implements as first-class records: incident, problem, change, configuration management.
  • International Organization for Standardization. 2015. ISO 9001:2015 - Quality Management Systems - Requirements. Geneva: ISO. Supports: the QMS structural requirements CODITECT inherits at the platform layer.
  • International Council for Harmonisation. 2008. ICH Harmonised Tripartite Guideline: Pharmaceutical Quality System Q10. Geneva: ICH. Supports: the CAPA process and deviation-management discipline CODITECT brings into the audit trail.
  • Deming, W. Edwards. 1986. Out of the Crisis. Cambridge, MA: MIT Press. Supports: the Plan-Do-Study-Act cycle behind CODITECT's iteration loop.

Multi-tenant architecture and SaaS engineering

  • Bezemer, Cor-Paul, and Andy Zaidman. 2010. "Multi-tenant SaaS Applications: Maintenance Dream or Nightmare?" In Proceedings of the Joint ERCIM Workshop on Software Evolution (EVOL) and International Workshop on Principles of Software Evolution (IWPSE), 88-92. Supports: the analytical foundation for tenant-isolated row-level security CODITECT applies on cloud Postgres SSOT (per ADR-301).
  • Krebs, Rouven, Christof Momm, and Samuel Kounev. 2012. "Architectural Concerns in Multi-tenant SaaS Applications." In CLOSER 2012. Supports: the isolation, performance, and customisation taxonomy informing CODITECT's tenant model.

AI safety, alignment, and harness research

  • Bai, Yuntao, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, et al. 2022. "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback." arXiv:2204.05862. Supports: RLHF as the alignment mechanism behind the foundation models CODITECT routes to.
  • Bai, Yuntao, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, et al. 2022. "Constitutional AI: Harmlessness from AI Feedback." arXiv:2212.08073. Supports: Anthropic's Constitutional AI approach - the harmlessness foundation of Claude, CODITECT's primary execution model today.
  • Hendrycks, Dan, Mantas Mazeika, and Thomas Woodside. 2023. "An Overview of Catastrophic AI Risks." arXiv:2306.12001. Supports: the threat-model framework that motivates rigorous audit trails and deterministic agent execution.
  • Anthropic. 2024. "Engineering Best Practices for Building Agents with Claude." Engineering blog, October 2024. Supports: the AI harness architectural principles published by the Claude team that CODITECT's harness implements.

How to read this bibliography

Citations are in Chicago author-date style. Standards bodies and regulations are cited under the issuing organisation. Each entry's annotation names the specific CODITECT decision the source informs - so a reviewer can trace any architectural choice back to its evidence in under a minute.

The list is not exhaustive. It is the working set behind CODITECT's published ADRs and the platform's current build. New citations are added as new architectural decisions are recorded; obsolete ones are pruned when a superseding result is integrated.

If you want to verify a specific decision, the platform's ADR set is the primary evidence; this page is the link from the ADR to the underlying research.