Digital Twin Scalability Dataset

What it is

A stress test, recorded

This is the sibling project to the Robot-Arm Digital Twin. Where that one was the system, this is the measurement: a dataset captured while progressively spinning up more robot-manipulator instances against the same edge-based digital-twin service, watching how resource use and responsiveness degrade as load climbs.

It comes in three flavours that differ only in how aggressively new robots arrive — and a labelled version exists to train a classifier, which is where the project stops being "a CSV" and becomes a question about intelligent autoscaling.

60s

Micro dataset
+1 robot / minute

300s

Small dataset
+1 robot / 5 min

3600s

Big dataset
+1 robot / hour

The stack

From telemetry to decision

subject

Edge Digital Twin

The robot-arm twin service under test — the thing being scaled and measured.

data

Time-series telemetry

Resource and performance metrics logged as robot instances are added at fixed intervals.

Random Forest

A classifier trained on the labelled big dataset — an ensemble of decision trees that votes on the system's state.

objective

SLA management

Service-Level Agreements set the threshold; the model predicts breaches so scaling can act in time.

action

Autoscaling

The payoff: scale the service up/down based on a prediction, not a lagging alarm.

context

AIML-as-a-Service

The same prediction can be offered as a network service — autoscaling intelligence delivered on demand rather than hand-tuned.

How it works

The experiment loop

Define the arrival rate
Choose micro / small / big — i.e. how fast new robot instances join the load.
Scale & record
Spin up instances on schedule, logging resource and timing metrics throughout the run.
Label the states
Tag samples (e.g. SLA-met vs. at-risk) to create the supervised target for the big dataset.
Train the classifier
Fit a Random Forest to map live telemetry to a predicted SLA state.
Predict & scale
In deployment, the model flags imminent breaches so the orchestrator scales ahead of failure.

Reflection

What rebuilding it taught me

A dataset is an experiment design. The three arrival rates aren't arbitrary — they're knobs that let you study scaling behaviour at different time scales.
Autoscaling wants prediction, not reaction. A threshold alarm fires after you're already in trouble; a classifier can warn you before.
Random Forests are a sane default. Robust, little tuning, and they tell you which metrics mattered — perfect for a first systems-ML model.
Labels are the expensive part. Collecting telemetry is easy; deciding what "about to breach" means is the real research.

A stress test, recorded

From telemetry to decision

Edge Digital Twin

Time-series telemetry

Random Forest

SLA management

Autoscaling

AIML-as-a-Service

The experiment loop

Define the arrival rate

Scale & record

Label the states

Train the classifier

Predict & scale

What rebuilding it taught me