Analysis of large language model capabilities, limitations, and practical applications. We examine prompt engineering, fine-tuning approaches, retrieval augmentation, and how to integrate LLMs into products that create genuine value.
17 articles

A clarifying primer that recasts "sovereign AI" not as a slogan but as control across four layers - data, compute, weights, governance - and shows that only a small, identifiable minority of enterprise workloads genuinely need more than the data layer existing tools already provide.

Most organisations measure LLM success using traditional software metrics whilst sitting on transformational cognitive infrastructure they barely understand how to evaluate properly.

Most organisations treat LLM security like traditional DevOps while ignoring novel attack vectors through model weights, training data, and prompt injection that conventional tools cannot detect.

GPT-4 struggles to count letters in "CharGPT" versus "ChatGPT" because tokenisation (the process of breaking text into processable units) fundamentally shapes what AI models can perceive, revealing why some companies' AI implementations fail at the architectural level rather than the reasoning level.

Most companies are burning money on LLM implementations by defaulting to expensive fine-tuning when sophisticated prompting could achieve comparable results at a fraction of the cost and complexity.

Despite their transformative potential, Large Language Models (LLMs) necessitate robust evaluation and strategic implementation to ensure they deliver real value rather than becoming a costly gamble.

QLoRA revolutionises the fine-tuning of large language models by combining quantisation and low-rank adaptation to significantly reduce memory usage while preserving performance, making advanced AI accessible to a broader range of users.

Small language models (SLMs), characterised by their efficiency and versatility, are emerging as pivotal tools for language processing, offering significant advantages in resource optimisation and accessibility, while challenging the dominance of larger models.

Enterprise knowledge graphs, enhanced by retrieval-augmented generation, are essential for transforming data silos into interconnected knowledge ecosystems, but their success hinges on data quality, scalability, security, and user-centric design.

Low-Rank Adaptation (LoRA) revolutionises the fine-tuning of large language models by enabling efficient model adaptation with minimal computational resources, while raising important ethical considerations.

A modern AI skills taxonomy is essential for building versatile teams that go beyond traditional data science to include advanced technical, interdisciplinary, and ethical competencies for future innovation.

Ollama enables local deployment of Large Language Models (LLMs), offering enhanced privacy, control, and efficiency for organisations seeking to harness the power of LLMs while maintaining oversight of their operational environment.

AI agents are intelligent systems that autonomously handle tasks, enhancing efficiency and reducing costs.

Process reward models (PRMs) train AI by providing feedback at each step of a task, enhancing understanding and problem-solving abilities.

Organizations considering building their own large language models (LLMs) should weigh the benefits of control and specialisation against challenges like high computational needs and expertise requirements.

By visualising the transformer as a dynamic conversation between human participants, we can grasp the core principles behind this influential neural network architecture.

A brief explanation of adversarial models and some potential use cases for them