Recomputing large datasets on demand
Run large-scale recalculations when needed, without maintaining infrastructure for occasional workloads
Introduction
In many business systems, core data is not static. Changes in business rules, financial parameters, or regulations can require recalculating large volumes of existing data.
These recalculations are typically:
- Computationally intensive
- Time-sensitive
- Infrequent, but critical
Traditional systems are not well suited for this pattern. They often rely on dedicated infrastructure that must remain available, even though it is only heavily used during limited periods.
Context: insurance and pension systems
This pattern is common in financial and insurance domains.
In this use case, a pension insurance system required recalculating all individual plans whenever new parameters were introduced for a given timeframe, such as indexation or profit conditions.
This meant:
- Processing large datasets
- Applying updated business rules consistently
- Ensuring correctness across all contracts
The challenge of occasional heavy workloads
These recalculations do not happen continuously. They are triggered only when:
- Financial parameters change
- New business rules are introduced
- Regulatory updates take effect
Traditional approaches typically rely on:
- Dedicated batch servers
- Fixed infrastructure sized for worst-case scenarios
This leads to:
- High infrastructure costs
- Low utilization outside recalculation periods
- Limited flexibility when workloads increase
In practice, systems are either:
- Overprovisioned (expensive most of the time), or
- Too slow during critical recalculation windows
Additionally, running large recalculations in parallel can put significant pressure on underlying systems such as databases, leading to contention, throttling, or failures.
The redesign approach with Taskurai
Instead of relying on fixed infrastructure, recalculation workloads are offloaded to Taskurai as asynchronous, durable tasks.
Key principles:
- Each recalculation unit is executed as a Taskurai task
- Workers process tasks in parallel
- Execution is fully asynchronous and durable
- Compute capacity scales automatically based on demand
At the same time, Taskurai allows controlling how work is distributed and executed, ensuring that downstream systems such as databases are not overwhelmed.
This shifts the model from:
👉 Fixed infrastructure sized for worst-case
to
👉 On-demand execution with controlled throughput
Handling large-scale recalculations
With Taskurai:
- Large datasets can be split into smaller tasks
- Tasks are executed in parallel across multiple workers
- Progress is tracked centrally
- Failures can be retried or resumed automatically
In addition, execution can be tuned to balance load:
- Control the number of concurrent workers
- Limit throughput towards specific systems
- Spread processing over time when needed
👉 This makes it possible to scale compute without overloading critical dependencies
During recalculation periods:
- Workers scale up to handle the workload
Outside of these periods:
- Workers scale down to zero
Operational visibility and control
Running large recalculations introduces operational risk, especially under time pressure.
Taskurai provides:
- Centralized tracking of all recalculation tasks
- Clear visibility into progress and completion
- Detailed logs and failure diagnostics
- The ability to retry or resume tasks without restarting the entire process
This improves:
- Operational control during recalculation periods
- Time to resolution when issues occur
- Confidence in the correctness of large-scale processing
Business impact
Reduced infrastructure costs
- No dedicated batch servers required
- No idle infrastructure between recalculation periods
- Lower operational and maintenance overhead
Faster processing with controlled load
- Parallel execution reduces total processing time
- Throughput can be tuned to avoid overloading databases and external systems
- Large datasets can be processed safely within required timeframes
Improved flexibility
- Easily adapt to new business rules or regulatory changes
- Control execution speed based on system capacity
- No dependency on fixed batch environments
Modernized architecture
- Transition away from legacy batch processing systems
- Easier to maintain and extend
- Better integration with modern applications and services
Summary
By using Taskurai to execute large-scale recalculations on demand:
- Heavy processing is handled efficiently
- Infrastructure costs are reduced
- Workloads scale dynamically with demand
- Downstream systems remain protected through controlled execution
All without relying on dedicated systems that are only used occasionally.
