PyTorch nn.Linear

개념공부/기타

Zach Choi 2023. 2. 6. 22:05

728x90

Fully Connected Layer 구성 시 기본적으로 선택하는 식 중 하나

입력 데이터 x에 대해 Linear Transformation (선형 변환)을 계산한다.

$$ y = xA^{T}+b $$

class Torch.nn.Linear (in_features, out_features, bias=True, device=None, dtype=None)

Parameters:

Shape:

Variables: (학습되는 변수들)

weight : Linear Transformation의 A에 해당한다. Shape - (out_feature, in_feature). 가중치들은 입력 Feature의 개수를 역수 취한 값의 제곱근 값의 범위로 초기화 된다고 한다. (PyTorch 공식 문서) 왜지..?
bias : Linear Transformation의 b에 해당한다. Shape (out_features). 이 값도 weight와 동일한 범위로 초기화 된다.

728x90