- Lumiera
- Posts
- 🔆 The AI Cost Iceberg: Total Cost of Ownership (part 1 of 2)
🔆 The AI Cost Iceberg: Total Cost of Ownership (part 1 of 2)
Programmable biology, IP-friendly AI for creators, and a more complete picture of what it costs to implement AI successfully.
🗞️ Issue 58 // ⏱️ Read Time: 8 min
Hello 👋
When people talk about the cost of AI, there's often a focus on the headline-grabbing training costs of large models like Deepseek-R1 and GPT-4. However, the true cost of AI adoption is far more nuanced and complex. This week, we're breaking down the real numbers behind AI development, training, and implementation to help decision-makers better understand the financial impact of AI.
In this week's newsletter
What we’re talking about: A more comprehensive and realistic assessment of the total cost of ownership (TCO) of organizations who wish to build and deploy AI systems.
How it’s relevant: Most companies are not building foundational models, they’re integrating 3rd party AI tools into their products and services. With most discussions around AI costs focusing on initial training and inference, organizations are left with a skewed understanding of how much to budget for their AI transformation.
Why it matters: Miscalculating or overlooking the TCO of AI implementation can severely disrupt an organization's strategic initiatives including forced compromises, delayed innovation, or strained resources across departments.
Big tech news of the week…
🎨 Adobe has launched its Firefly Video Model in public beta. Firefly is only trained on content that Adobe has permission to use, which includes licensed content from Adobe Stock and public domain content, making it the only generative AI video model that is IP (Intellectual Property)-friendly and commercially safe.
⚖️ The first U.S. ruling on fair use in AI-related copyright litigation has been made. A federal judge in Delaware has ruled that Thomson Reuters' competitor, Ross Intelligence, violated copyright law by using Thomson Reuters' content to build a competing AI-based legal platform. The ruling is seen as a significant victory for content owners in their ongoing legal battles against AI companies, potentially setting a precedent for future cases involving AI and copyright infringement
🧬 Latent Labs, a new startup founded by a former Google DeepMind scientist, launches with $50M in funding to make biology programmable. They aim to enable researchers to “computationally create” new therapeutic molecules from scratch, upending the current drug-discovery process.
🇬🇧The UK’s AI Safety Institute becomes AI Security Institute to “reflect its focus on serious AI risks with security implications, such as how the technology can be used to develop chemical and biological weapons, how it can be used to carry out cyber-attacks, and enable crimes such as fraud and child sexual abuse.”
AI Total Cost of Ownership
For most organizations, the cost of AI systems doesn't end when a foundational model is released into general availability; that's where it begins. Total cost of ownership (TCO) in the AI context goes beyond the traditional definition of purchase and deployment costs. It encompasses the entire journey of AI adoption: from initial acquisition and customization, through ongoing operations and maintenance, to the eventual evolution or retirement of the system.
Unlike traditional IT systems, AI TCO is characterized by unique challenges:
the need for advanced hardware infrastructure
specialized expertise
continuous model updates
data quality management, and
cultural transformation.
As organizations rush to adopt AI, understanding these comprehensive costs becomes crucial for sustainable implementation and long-term success.
Today, we’ll discuss the cost considerations of build vs. buy, pay-as-you-go, and customization. Other key components like Design Thinking and Cost of Culture will be covered in next week’s newsletter, making this newsletter Part 1 out of 2. Let’s get into it!
The Build v. Buy Decision
One of the major strategic decisions organizations face when implementing AI is whether to build custom solutions or buy existing ones. This choice fundamentally shapes the TCO. In 2025, most organizations are purchasing AI in the form of closed-source foundation models and either using them off the shelf or customizing them to their needs. While buying existing solutions typically has lower upfront costs, organizations must carefully consider whether or not to rely on closed-source models. These models can offer stronger performance but also create dependency on vendor pricing and policies.
OpenAI, Anthropic, and Google lead the pack, but a few others are not far behind, including Meta’s Llama as an open-source option. Survey results indicate that very few (<5%) stick to one model vendor, and the average company has 2.3 model vendors in production, in an attempt to stay flexible.
The hybrid approach of using multiple vendors simultaneously does reduce dependency risks and allows you to leverage different models' strengths. However, it also introduces its own costs in terms of integration complexity and vendor management.
Regardless of whether you build or buy, you need to budget for integration, customization, maintenance, monitoring, and regular updates to keep systems performing optimally.
Pay-as-you-Go
Cloud computing popularized and expanded the use of the Pay-As-You-Go (PAYG) pricing model in the technology sector. AI API providers have widely adopted this model, and many companies are opting for cloud-based AI services to avoid large hardware investments.
Anthropic’s Claude 3.5 Sonnet API
Anthropic offers a PAYG model for its Claude API:
Claude 3.5 Sonnet: $3 / 1M input tokens and $15 / 1M output tokens
Claude 3.5 Haiku: $.80 / 1M input tokens and $4 / 1M output tokens
Additional services like latency optimization for faster responses are priced separately.

Check out this tokenizer tool from OpenAI to get a feeling for how text translates into tokens.
🤖 Real World Example - Code Assistance Tool
A development tool using Claude 3.5 Sonnet (the leading model in coding proficiency) for code suggestions and explanations, handling 5,000 requests per day. Each request uses 500 input tokens and 1,000 output tokens:
Input: $7.50 | Output: $75.00
Total daily cost: $82.50
Monthly cost (30 days): $2,475
📝 5,000 tokens per day is on the lower end of usage for a business code assistance tool. Many companies are likely to use significantly more, especially for larger development teams or more complex projects.
Try this calculator to compare costs for sample text/code across models.
The PAYG model allows businesses to scale their AI usage as needed without large upfront investments. However, it's crucial to carefully monitor and optimize usage to prevent unexpected cost escalations as applications grow.
How is your organization budgeting for AI transformation?
Customization Costs
Foundation models like GPT-4 and Claude are trained on massive datasets collected from across the internet. This broad training helps them recognize patterns in everything from scientific papers to social media posts. While this extensive training makes these models versatile, the real costs begin when organizations need to adapt and customize them for specific business needs.
There are a few ways to customize a model for your given use case:
a) Prompt Engineering involves crafting specific inputs to guide the model's output without modifying its parameters. Costs include:
Skilled prompt engineers
Continuous optimization
Documentation management
Staff training
Regular updates as models evolve
b) Retrieval Augmented Generation (RAG) enhances LLM responses by referencing external, authoritative knowledge bases. Costs include:
Document processing infrastructure
Vector database management
Knowledge base maintenance
Continuous data updates
Integration with existing systems
Additional compute costs for retrieval
c) Fine-tuning involves adjusting a pre-trained model's parameters using task-specific data. Costs include:
Computing resources for model adaptation
Data cleaning and preparation (a top cost driver)
Testing and validation cycles
Performance monitoring
Regular updates and optimization
Subject matter expert time (10.4% of total costs)
📣 Recent research shows that data preparation alone accounts for 13.2% of AI implementation costs, with companies often underestimating the complexity of making these powerful but general-purpose models work for specialized tasks.
Beyond the Tech
Realizing the potential of AI requires a clear understanding of the full investment required. Organizations that thrive in this new era take a structured, deliberate approach to AI adoption - one that acknowledges not just the obvious technical costs, but the essential investments in design, culture, and human capability.
Success requires sustained investment in design thinking and cultural change. In Part 2 of this series, we’ll go further into the costs associated with organizational readiness. Stay tuned!
Until next time.
On behalf of Team Lumiera
Lumiera has gathered the brightest people from the technology and policy sectors to give you top-quality advice so you can navigate the new AI Era.
Follow the carefully curated Lumiera podcast playlist to stay informed and challenged on all things AI.
What did you think of today's newsletter? |

Disclaimer: Lumiera is not a registered investment, legal, or tax advisor, or a broker/dealer. All investment/financial opinions expressed by Lumiera and its authors are for informational purposes only, and do not constitute or imply an endorsement of any third party's products or services. Information was obtained from third-party sources, which we believe to be reliable but not guaranteed for accuracy or completeness. |