Recomputing large datasets on demand

Run large-scale recalculations when needed, without maintaining infrastructure for occasional workloads

Introduction

In many business systems, core data is not static. Changes in business rules, financial parameters, or regulations can require recalculating large volumes of existing data.

These recalculations are typically:

  • Computationally intensive
  • Time-sensitive
  • Infrequent, but critical

Traditional systems are not well suited for this pattern. They often rely on dedicated infrastructure that must remain available, even though it is only heavily used during limited periods.

Context: insurance and pension systems

This pattern is common in financial and insurance domains.

In this use case, a pension insurance system required recalculating all individual plans whenever new parameters were introduced for a given timeframe, such as indexation or profit conditions.

This meant:

  • Processing large datasets
  • Applying updated business rules consistently
  • Ensuring correctness across all contracts

The challenge of occasional heavy workloads

These recalculations do not happen continuously. They are triggered only when:

  • Financial parameters change
  • New business rules are introduced
  • Regulatory updates take effect

Traditional approaches typically rely on:

  • Dedicated batch servers
  • Fixed infrastructure sized for worst-case scenarios

This leads to:

  • High infrastructure costs
  • Low utilization outside recalculation periods
  • Limited flexibility when workloads increase

In practice, systems are either:

  • Overprovisioned (expensive most of the time), or
  • Too slow during critical recalculation windows

Additionally, running large recalculations in parallel can put significant pressure on underlying systems such as databases, leading to contention, throttling, or failures.

The redesign approach with Taskurai

Instead of relying on fixed infrastructure, recalculation workloads are offloaded to Taskurai as asynchronous, durable tasks.

Key principles:

  • Each recalculation unit is executed as a Taskurai task
  • Workers process tasks in parallel
  • Execution is fully asynchronous and durable
  • Compute capacity scales automatically based on demand

At the same time, Taskurai allows controlling how work is distributed and executed, ensuring that downstream systems such as databases are not overwhelmed.

This shifts the model from:

👉 Fixed infrastructure sized for worst-case
to
👉 On-demand execution with controlled throughput

Handling large-scale recalculations

With Taskurai:

  • Large datasets can be split into smaller tasks
  • Tasks are executed in parallel across multiple workers
  • Progress is tracked centrally
  • Failures can be retried or resumed automatically

In addition, execution can be tuned to balance load:

  • Control the number of concurrent workers
  • Limit throughput towards specific systems
  • Spread processing over time when needed

👉 This makes it possible to scale compute without overloading critical dependencies

During recalculation periods:

  • Workers scale up to handle the workload

Outside of these periods:

  • Workers scale down to zero

Operational visibility and control

Running large recalculations introduces operational risk, especially under time pressure.

Taskurai provides:

  • Centralized tracking of all recalculation tasks
  • Clear visibility into progress and completion
  • Detailed logs and failure diagnostics
  • The ability to retry or resume tasks without restarting the entire process

This improves:

  • Operational control during recalculation periods
  • Time to resolution when issues occur
  • Confidence in the correctness of large-scale processing

Business impact

Reduced infrastructure costs

  • No dedicated batch servers required
  • No idle infrastructure between recalculation periods
  • Lower operational and maintenance overhead

Faster processing with controlled load

  • Parallel execution reduces total processing time
  • Throughput can be tuned to avoid overloading databases and external systems
  • Large datasets can be processed safely within required timeframes

Improved flexibility

  • Easily adapt to new business rules or regulatory changes
  • Control execution speed based on system capacity
  • No dependency on fixed batch environments

Modernized architecture

  • Transition away from legacy batch processing systems
  • Easier to maintain and extend
  • Better integration with modern applications and services

Summary

By using Taskurai to execute large-scale recalculations on demand:

  • Heavy processing is handled efficiently
  • Infrastructure costs are reduced
  • Workloads scale dynamically with demand
  • Downstream systems remain protected through controlled execution

All without relying on dedicated systems that are only used occasionally.

Start building with Taskurai today!