Tracks

Tracks serve as an abstraction layer in Diffusion Studio, providing two key functionalities:

Efficient Clip Rendering: Tracks determine which clips will be rendered, enabling the software to bypass unnecessary iterations over non-visible clips. Only visible clips within the track are processed.
Layering Control: Tracks define the layering order of visible elements. The Track at index 0 will be rendered last, meaning it appears on top of other tracks.

Track Types and Creation

Similar to clips, tracks are typed. For instance, you can create a TextTrack, which is designed to hold only TextClips or classes derived from TextClip (e.g. ComplexTextClip):

import { TextTrack } from '@diffusionstudio/core';
 
const track = new TextTrack();

You can now add clips to the track, but it's important to note that clips cannot overlap.

await track.add(
  new TextClip({
    text: 'Hello',
    start: 0, // Default start time
    stop: 30, 
  })
);
 
await track.add(
  new TextClip({
    text: 'World',
    start: 31, 
    stop: 60,
  })
);

If clips overlap, Diffusion Studio will automatically adjust the start/stop times or move the clip to a different track. The add method is asynchronous because Diffusion Studio waits until the clip is fully loaded before adding it to the track.

You can render a track by adding it to a composition:

import { Composition } from '@diffusionstudio/core';
 
const composition = new Composition();
 
composition.shiftTrack(track);

By calling track.remove(clip) you can remove a Clip from the Track

Stack Mode

Tracks can also be configured to operate in stack mode, which ensures that each new clip added starts exactly where the previous clip ended (plus 1 millisecond). Stack mode can be enabled as follows:

track.stacked(); // disable with track.stacked(false);

Layering

Let's set up an example to illustrate how tracks are layered:

// alterantive approach for creating tracks 
// and adding them to the composition
const video = composition.createTrack('video'); 
const image = composition.createTrack('image');
const text = composition.createTrack('text');

When a track is added to the composition it will be added to position 0 and rendered last. Our composition.tracks array looks like this [TextTrack, ImageTrack, VideoTrack]. When visualized the text will be rendered on top of the images, which will be rendered on top of all VideoClips inside the VideoTrack.

Each Track implements the layer method which can be used to change the order. It accepts an index or top | bottom:

video.layer('top');

Now VideoTrack will be rendered on top of all other tracks.

Managing Multiple Clips

Tracks offer several useful methods for managing multiple clips simultaneously. For example, the offsetBy method shifts all clips within the track by a specified amount of time:

track.offsetBy(40);

Another powerful method is apply, which applies a function to each clip in the track. For example, you can offset all clips by 40 milliseconds:

track.apply(clip => clip.offsetBy(40)); // will trigger more events compared to offsetBy

Caption Track

The CaptionTrack class provides specialized features for managing captions of audio or video clips.

To begin, you need an AudioClip or VideoClip with an assigned transcript. For demonstration, we'll use a MediaClip, which is a base class for these types:

import { Transcript, MediaClip } from '@diffusionstudio/core';
 
const transcript = new Transcript(); // Assume this is populated with data
 
const clip = new MediaClip({ transcript });

You can then create captions from the MediaClip using a CaptionTrack:

await composition
  .createTrack('caption') // creates CaptionTrack
  .from(clip)             // sets the source
  .generate();            // generates text clips from the source

The generate method supports various caption presets.

Using Caption Presets

See all available presets in action

One advantage of linking the CaptionTrack to a MediaClip is that captions will automatically adjust when you move the clip using offsetBy.