Dzone

Building Intelligent Microservices With Go and AWS AI Services

Coupling Go's lightweight programming capabilities with AWS' robust AI services allows developers to build performant, scalable, and intelligent microservices devoted to diverse business needs. This blog explains how Go, and AWS AI services can be combined to create intelligent microservices, discus ...

Data Privacy and Governance in Real-Time Data Streaming

Real-time data streaming is changing the way organizations handle information. Whether it’s IoT devices sending sensor updates, retail platforms tracking customer activity, or financial institutions monitoring transactions for fraud, processing data “as it happens” gives you a major edge. When done ...

Dive Into Tokenization, Attention, and Key-Value Caching

The Rise of LLMs and the Need for Efficiency In recent years, large language models (LLMs) such as GPT, Llama, and Mistral have impacted natural language understanding and generation. However, a significant challenge in deploying these models lies in optimizing their performance, particularly for ta ...

Creating an Agentic RAG for Text-to-SQL Applications

The blend of retrieval-augmented generation (RAG) and generative AI models has brought changes to natural language processing by improving the responses to queries. In the realm of Agentic RAG, this conventional method of relying on a monolithic model for tasks has been enhanced by introducing modul ...

Implement Amazon S3 Cross-Region Replication With Terraform

With the information technology element finding its roots in every financial organization and across all industries, strong storage capacity forms the backbone for availability, durability, and scalability. Among these, Amazon S3 is one of the most popular services to meet these needs. As enterprise ...

Container Checkpointing in Kubernetes With a Custom API

Problem Statement Challenge Organizations running containerized applications in Kubernetes often need to capture and preserve the state of running containers for: Disaster recovery Application migration Debug/troubleshooting State preservation Environment reproduction However, there's no stra ...

The Delegated Chain of Thought Architecture

This article introduces the Delegated Chain of Thought (D-CoT) Architecture, a novel framework for large language models (LLMs) that decouples reasoning from execution.The architecture centralises reasoning in a "modulith" model while delegating execution tasks to smaller, specialised models.  ...

Generate Unit Tests With AI Using Ollama and Spring Boot

There are scenarios where we would not want to use commercial large language models (LLMs) because the queries and data would go into the public domain. There are ways to run open-source LLMs locally. This article explores the option of running Ollama locally interfaced with the Sprint boot applicat ...

Multimodal RAG With Colpali, Milvus, and VLMs

In this post, we will see how to do multimodal RAG with ColPali, Milvus, and a visual language model (Gemini/GPT-4o). We will build an application to upload a PDF and then do Q&A queries on it. Q&A can be done on both text and visual elements of the PDF. We will not extract text from the PDF ...

A Step-by-Step Guide to Enterprise Application Development

Having spent more late nights untangling enterprise spaghetti code than I care to admit, I can confidently say developing enterprise applications is not for the faint of heart. While hobby apps crash because someone forgot a semicolon, enterprise code glitches could mean accidentally buying every em ...

Search: From Basic Document Retrieval to Answer Generation

In the digital age, the ability to find relevant information quickly and accurately has become increasingly critical. From simple web searches to complex enterprise knowledge management systems, search technology has evolved dramatically to meet growing demands. This article explores the journey fro ...

Understanding Infrastructure as Code at Scale

Infrastructural as Code is an IT approach where the process involved in infrastructure configuration, deployment, and monitoring are checked through coded expressions. IaC also increases the deployment rate and is significant when applied with cloud resources.  "IaC at scale" means using this a ...

The Slides of Hands-On Agile 2025

If you weren’t at the virtual Hands-on Agile 2025 conference earlier this month, you missed an incredible opportunity to explore the shift from concept-based to context-based agility with nearly 800 fellow agilists. But don’t worry — I’m here to share some of the key takeaways and insights! Check ou ...

Integrating Apex With Lightning Web Components

Salesforce’s Lightning Web Component (LWC) is a modern UI framework that developers use to create custom pages and functionalities on the Salesforce Lightning platform. While LWC allows developers to build powerful and interactive user interfaces, it often requires back-end integration to fetch or u ...

Scaling ML Models Efficiently With Shared Neural Networks

As machine learning models grow in complexity and size, organizations face increasing challenges in deploying and scaling these models efficiently. A particularly pressing challenge is balancing hardware memory constraints with the expanding size of ML models while maintaining high performance and c ...

Creating a Web Project: Key Steps to Identify Issues

When developing a product, issues inevitably arise that can impact both its performance and stability. Slow system response times, error rate increases, bugs, and failed updates can all damage the reputation and efficiency of your project.  However, before addressing these problems, it is essen ...

Have LLMs Solved the Search Problem?

The advent of large language models (LLMs) has catalyzed a paradigm shift in information retrieval and human-computer interaction. These models, trained on vast corpora of text and optimized for predictive linguistic tasks, have demonstrated substantial efficacy in responding to queries, summarizing ...

Apache Flink: Full Checkpoint vs Incremental Checkpoint

Apache Flink is a real-time data stream processing engine. Most of the stream processing applications are ‘stateful.’ This means the state is stored and used for further processing. In Apache Flink, the state is managed through a configured state backend. Flink supports two-state backends in product ...

Page Transactions and Page Object Model

Recently, I wrote an article on Page Transactions as a new approach to test automation, focusing on simplicity, readability, and flexibility. Now, I present a structured comparison of Page Transactions (PT) and Page Object Model (POM), explaining the pros and cons of each pattern. Finally, I wi ...

Build a Data Analytics Platform With Flask, SQL, and Redis

In this article, I’ll walk through the development of a Flask-based web application that interacts with an SQL Server database to analyze population data. The application allows users to query population ranges, fetch counties by state, and retrieve states within specific population ranges. I shall ...

From Data to Decisions: Visualizing SAP Insights With Python

SAP has always enjoyed the position of being the most trusted enterprise resource planning software where it brings in all the departments in an organization like sales, inventory, production, quality, human resources, and more. Data that resides in SAP ERP data is the bedrock for all decision-makin ...

Why Cloud Matters: Building Global, Scalable Microservices

Software engineers must develop applications that are not only functional but also scalable, resilient, and globally distributed. This is where cloud computing plays a crucial role. Cloud platforms provide the foundation to build scalable microservices, ensuring high availability, efficient resource ...

Reactive Programming in React With RxJS

ReactJS has become a go-to library for building dynamic and responsive user interfaces. However, as applications grow, managing asynchronous data streams becomes more challenging. Enter RxJS, a powerful library for reactive programming using observables. RxJS operators simplify handling complex asyn ...

Redis as a Primary Database for Complex Applications

First, we will see what Redis is and its usage, as well as why it is suitable for modern complex microservice applications. We will talk about how Redis supports storing multiple data formats for different purposes through its modules. Next, we will see how Redis, as an in-memory database, can per ...

Optimizing Database Performance in Middleware Applications

In the realm of modern software architecture, middleware plays a pivotal role in connecting various components of distributed systems. One of the most significant challenges faced by middleware applications is optimizing database interactions. This is crucial because middleware often serves as the b ...

Robust Integration Solutions With Apache Camel and Spring Boot

In today’s interconnected world, integrating systems, applications, and data is a critical requirement for businesses. However, building reliable and scalable integration solutions can be challenging due to the complexity of handling different protocols, data formats, and error scenarios. Apache Cam ...

ETL Generation Using GenAI

Generating ETL data pipelines using generative AI (GenAI) involves leveraging the capabilities of large language models to automatically create the code and logic for extracting, transforming, and loading data from various sources, significantly reducing manual coding efforts and accelerating pipeli ...

gRPC and Its Role in Microservices Communication

gRPC (gRPC Remote Procedure Calls) is a modern, open-source, high-performance RPC (Remote Procedure Call) framework developed by Google. It is designed to facilitate efficient communication between distributed systems, making it particularly well-suited for microservices architectures. Below is an e ...

Unlocking Local AI: Build RAG Apps Without Cloud or API Keys

Retrieval-augmented generation (RAG) is transforming how we interact with AI models by combining retrieval techniques with generative models. But what if you could build RAG applications locally, without API keys or cloud dependencies? Let's meet Chipper, an open-source framework that makes building ...

Git Tags vs AWS Tags: A Tag-Tastic Showdown!

Welcome to the ultimate showdown between Git Tags and AWS Tags, two contenders with nothing in common except their love for the word "tag." Let's put them head-to-head and see who wins the battle of confusion and utility! Round 1: Who They Are Git Tags The historians of the software world. They don' ...