torchelie.transforms

class torchelie.transforms.ResizeNoCrop(size, mode=2)

Resize a PIL image so that its longer border is of size size

Parameters

size (int) – max size of the image

class torchelie.transforms.ResizedCrop(size, scale=1, ratio=1, interpolation=torchvision.transforms.InterpolationMode.BILINEAR)

Crop the given PIL Image to size. A crop of size of the original size is made. This crop is finally resized to given size.

Parameters
  • size – expected output size of each edge

  • scale – zoom factor

  • interpolation – Default: PIL.Image.BILINEAR

static get_params(img, scale, ratio=1)

Get parameters for crop. :param img: Image to be cropped. :type img: PIL Image :param scale: range of size of the origin size cropped :type scale: float

Returns

params (i, j, h, w) to be passed to crop.

Return type

tuple

class torchelie.transforms.AdaptPad(sz, padding_mode='constant', fill=0)

Pad an input image so that it reaches size size

Parameters
  • sz ((int, int)) – target size

  • padding_mode (str) – one of the modes of torchvision.transforms.pad

class torchelie.transforms.MultiBranch(transforms)

Transform an image with multiple transforms

Parameters

transforms (list of transforms) – the parallel set of transforms

class torchelie.transforms.Canny(thresh_low: int = 100, thresh_high: int = 200)

Run Canny edge detector over an image. Requires OpenCV to be installed

Parameters
  • thresh_low (int) – lower threshold (default: 100)

  • thresh_high (int) – upper threshold (default: 200)

class torchelie.transforms.RandAugment(n_transforms: int, magnitude: float, interpolation: torchvision.transforms.InterpolationMode = torchvision.transforms.InterpolationMode.BILINEAR, fill: Optional[List[float]] = None)

RandAugment policy from RandAugment: Practical automated data augmentation with a reduced search space.

Parameters
  • n_transforms (int) – how many transforms to apply

  • magnitude (float) – magnitude of the transforms. 10 is base rate, can be set to more.

  • interpolation – interpolation to use for suitable transforms

  • fill – fill value to use for suitable transforms

berserk_mode()torchelie.transforms.randaugment.RandAugment

Load even more transforms

forward(img: PIL.Image.Image)PIL.Image.Image

img (PIL Image or Tensor): Image to be transformed.

Returns

RandAugmented image.

Return type

PIL Image or Tensor

class torchelie.transforms.Posterize(min_bits: int = 4, max_bits: int = 8)

Apply a Posterize filter with a random number of bits.

Parameters
  • min_bits (int) – minimum color encoding bits

  • max_bits (int) – maximum color encoding bits

class torchelie.transforms.Solarize(max_thresh: int = 128)

Apply a Solarize filter with a random threshold.

Parameters

max_thresh (int) – upper bound for the random threshold.

class torchelie.transforms.Cutout(min_size: float, max_size: float)

Applies a random Cutout filter erasing at most :code:`max_size*100`% of the picture.

Parameters

max_size (float) – the maximum ratio that can be erased. 0 means no erasure, 1 means up to the whole image can be erased.

class torchelie.transforms.Identity

Do nothing

class torchelie.transforms.Subsample(min_ratio: int = 1, max_ratio: int = 3, p: float = 0.5, interpolation: torchvision.transforms.InterpolationMode = torchvision.transforms.InterpolationMode.BILINEAR)

Randomly subsample images.

Parameters
  • p (float) – the transform is applied with probability p

  • max_ratio (int) – maximum subscaling factor

  • interpolation (InterpolationMode) – interpolation mode

class torchelie.transforms.JPEGArtifacts(min_compression: float = 0.5, p: float = 0.5, **jpeg_args)

Add some random jpeg compression artifacts

Parameters
  • p (float) – probability of applying the filter

  • min_compression (float) – minimum quality (1: maximum quality)

torchelie.transforms.differentiable

class torchelie.transforms.differentiable.AllAtOnceColor(B: int, init: Optional[torch.Tensor] = None)

Similar to AllAtOnceGeometric, performs multiple color transforms at once.

Parameters
  • B (int) – batch size

  • init (torch.Tensor) – an initial user supplied transformation matrix. If not provided, default to identity.

apply(x: torch.Tensor)torch.Tensor

Applies transforms on x

Parameters

x (torch.Tensor) – input

Returns

transformed x

brightness(alpha: float, prob: float = 1.0)torchelie.transforms.differentiable.AllAtOnceColor

Change brightness by a factor alpha

Parameters

alpha (float) – scale factor

contrast(alpha: float, prob: float = 1.0)torchelie.transforms.differentiable.AllAtOnceColor

Scale contrast by factor alpha

Parameters

alpha (float) – scale factor

class torchelie.transforms.differentiable.AllAtOnceGeometric(B: int, init: Optional[torch.Tensor] = None)

Various geometric transforms packed up into an affine transformation matrix. Transformations can be stacked then applied on a 4D tensor to reduce artifact, memory usage, and compute. Fully differentiable.

>>> img = torch.randn(10, 3, 32, 32)
>>> transformed = AllAtOnceGeometric(10)            .translate(5, 5).scale(0.9).apply(img)

Note: the transformations get sampled at creation, so that each call to apply() runs the same transforms. Construct another AllAtOnceGeometric object for another set of transform. This allows to easily run the same transforms on paired datasets.

Note2: Each transform has a prob argument which specifies whether to use the transform or bypass it. This makes it easy to implement StyleGAN2-ADA.

Parameters
  • B (int) – batch size

  • init (torch.Tensor) – an initial user supplied transformation matrix. If not provided, default to identity.

rotate(theta: float, prob: float = 1.0)torchelie.transforms.differentiable.AllAtOnceGeometric

Rotate the image by an angle randomly sampled between [-theta, theta]

Parameters

theta (float) – an angle in degrees

scale(x: float, y: float, prob: float = 1.0)torchelie.transforms.differentiable.AllAtOnceGeometric

Randomly scale the image horizontally by a factor [1 - x; 1 + x] and vertically by a factor of [1 - y; 1 + y].

Parameters
  • x (float) – horizontal factor

  • y (float) – vertical factor

translate(x: float, y: float, prob: float = 1.0)torchelie.transforms.differentiable.AllAtOnceGeometric

Randomly translate image horizontally with an offset sampled in [-x, x] and vertically [-y, y]. Note that the coordinate are not pixel coordinate but texel coordinate between [-1, 1]

torchelie.transforms.differentiable.center_crop(batch, size)

Crop the center of a 4D images tensor

Parameters
  • batch (4D images tensor) – the tensor to crop

  • size ((int, int)) – size of the resulting image as (height, width)

Returns

The cropped image

torchelie.transforms.differentiable.crop(img, warped=True, sub_img_factor=2)

Randomly crop a sub_img_factor smaller part of img.

Parameters
  • img (3D or 4D image(s) tensor) – input image(s)

  • warped (bool) – Whether the image should be considered warped (default: True)

  • sub_img_factor (float) – fraction of the image to take. For instance, 2 will crop a quarter of the image (half the width, half the height). (default: 2)

torchelie.transforms.differentiable.gblur(input)

Gaussian blur with kernel size 3

Parameters

input (3D or 4D image(s) tensor) – input image

Returns

the blurred tensor

torchelie.transforms.differentiable.mblur(input)

Mean (or average) blur with kernel size 3

Parameters

input (3D or 4D image(s) tensor) – input image

Returns

the blurred tensor

torchelie.transforms.differentiable.roll(img, x_roll, y_roll)

Wrap an image

Parameters
  • img (3D or 4D image(s) tensor) – an image tensor

  • x_roll (int) – how many pixels to roll on the x axis

  • y_roll (int) – how many pixels to roll on the y axis

Returns

The rolled tensor