Create Python API for VideoEncoder #990

Dan-Flores · 2025-10-21T00:16:39Z

This PR creates a simple VideoEncoder class, and updates several tests to utilize the VideoEncoder.to_file pattern.

test_bad_input_parameterized is the VideoEncoder equivalent to the AudioEncoder test that ensures general error checking occurs, while test_bad_input tests method specific errors.
test_contiguity: is the VideoEncoder equivalent to the AudioEncoder test to ensure contiguous and non-contiguous tensors can be encoded, and are encoded equivalently.

NicolasHug · 2025-10-27T13:39:14Z

test/test_encoders.py

+            (torch.rand(num_frames, channels, height, width) * 255)
+            .to(torch.uint8)


Just use torch.randint(0, 256, size=(num_frames, channels, height, width), dtype=torch.uint8).contiguous()

NicolasHug · 2025-10-27T13:41:08Z

test/test_encoders.py

+            contiguous_frames.permute(0, 3, 2, 1).contiguous().permute(0, 3, 2, 1)
+        )
+        assert non_contiguous_frames.stride() != contiguous_frames.stride()
+        assert not non_contiguous_frames.is_contiguous()


This is good, but you should be able to check for this which is stricter:

assert non_contiguous_frames.is_contiguous(memory_format=torch.channels_last)

So far, I have assumed frames will be in NCHW format. This shape is retained after the permutations, so I do not believe channels_last applies here?

The frames are always of NCHW shape . But internally they can have any arbitrary memory layout (format). These are mostly orthogonal concepts, i.e. frames can have NCHW shape while still being represented as NHWC layout (channels last).

I realize now the non_contiguous_frames you created are indeed not channels last. But we can force them to be by changing to:

contiguous_frames.permute(0, 2, 3, 1).contiguous().permute(0, 3, 1, 2)

and you should see that assert non_contiguous_frames.is_contiguous(memory_format=torch.channels_last) is passing now. Note how we're now permuting the channels dimension from N*HW to NHW*, and back.

Your test was still correct BTW - I just think it's good to actually explicitly check for channels_last because this is how frames may arrive.

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 21, 2025

Daniel Flores added 3 commits October 21, 2025 11:43

video encoder python file

1e06ea5

testing

1e7dc34

delete contiguous todo

cf7b75c

NicolasHug reviewed Oct 27, 2025

View reviewed changes

use randint suggestion, remove test skips

ee2285e

Dan-Flores force-pushed the vid_encoder_python branch from 3c6fd84 to ee2285e Compare October 27, 2025 17:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Create Python API for VideoEncoder #990

Create Python API for VideoEncoder #990

Dan-Flores commented Oct 21, 2025

Uh oh!

NicolasHug Oct 27, 2025

Uh oh!

NicolasHug Oct 27, 2025

Uh oh!

Dan-Flores Oct 27, 2025

Uh oh!

NicolasHug Oct 27, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		(torch.rand(num_frames, channels, height, width) * 255)
		.to(torch.uint8)

Uh oh!

Create Python API for VideoEncoder #990

Are you sure you want to change the base?

Create Python API for VideoEncoder #990

Conversation

Dan-Flores commented Oct 21, 2025

Uh oh!

NicolasHug Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

Dan-Flores Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NicolasHug Oct 27, 2025 •

edited

Loading