MSELoss¶
-
class
torch.nn.MSELoss(size_average=None, reduce=None, reduction: str = 'mean')[source]¶ Creates a criterion that measures the mean squared error (squared L2 norm) between each element in the input and target .
The unreduced (i.e. with
reductionset to'none') loss can be described as:where is the batch size. If
reductionis not'none'(default'mean'), then:and are tensors of arbitrary shapes with a total of elements each.
The mean operation still operates over all the elements, and divides by .
The division by can be avoided if one sets
reduction = 'sum'.- Parameters
size_average (bool, optional) – Deprecated (see
reduction). By default, the losses are averaged over each loss element in the batch. Note that for some losses, there are multiple elements per sample. If the fieldsize_averageis set toFalse, the losses are instead summed for each minibatch. Ignored when reduce isFalse. Default:Truereduce (bool, optional) – Deprecated (see
reduction). By default, the losses are averaged or summed over observations for each minibatch depending onsize_average. WhenreduceisFalse, returns a loss per batch element instead and ignoressize_average. Default:Truereduction (string, optional) – Specifies the reduction to apply to the output:
'none'|'mean'|'sum'.'none': no reduction will be applied,'mean': the sum of the output will be divided by the number of elements in the output,'sum': the output will be summed. Note:size_averageandreduceare in the process of being deprecated, and in the meantime, specifying either of those two args will overridereduction. Default:'mean'
- Shape:
Input: where means, any number of additional dimensions
Target: , same shape as the input
Examples:
>>> loss = nn.MSELoss() >>> input = torch.randn(3, 5, requires_grad=True) >>> target = torch.randn(3, 5) >>> output = loss(input, target) >>> output.backward()