십분딥러닝_18_GumBolt (VAE with Boltzmann Machine)

Download as pptx, pdf

0 likes261 views

HyunKyu Jeon

GumBolt (VAE with Boltzmann Machine) 에 관한 설명입니다.

Data & Analytics

십분딥러닝_18_GumBolt (VAE with Boltzmann Machine)

This document discusses finding "super tickets" in pre-trained language models through pruning attention heads and feedforward layers. It shows that lightly pruning BERT models can improve generalization without degrading accuracy (phase transition phenomenon). The authors propose a new pruning approach for multi-task fine-tuning of language models called "ticket sharing" where pruned weights are shared across tasks. Experiments on GLUE benchmarks show their proposed super ticket and ticket sharing methods consistently outperform unpruned baselines, with more significant gains on smaller tasks. Analysis indicates pruning reduces model variance and some tasks share more task-specific knowledge than others.

Synthesizer rethinking self-attention for transformer models HyunKyu Jeon

��

Domain Invariant Representation Learning with Domain Density TransformationsHyunKyu Jeon

��

The document discusses domain invariant representation learning aimed at creating models that generalize well to unseen domains, contrasting it with domain adaptation. It proposes a method that enforces invariance across transformations between domains and utilizes generative adversarial networks to implement these transformations. The effectiveness of the proposed approach is demonstrated on various datasets, achieving competitive results compared to state-of-the-art methods in domain generalization.

Meta back translationHyunKyu Jeon

��

This document summarizes Meta Back-Translation, a method for improving back-translation by training the backward model to directly optimize the performance of the forward model during training. The key points are: 1. Back-translation typically relies on a fixed backward model, which can lead the forward model to overfit to its outputs. Meta back-translation instead continually trains the backward model to generate pseudo-parallel data that improves the forward model. 2. Experiments show Meta back-translation generates translations with fewer pathological outputs like greatly differing in length from references. It also avoids both overfitting and underfitting of the forward model by flexibly controlling the diversity of pseudo-parallel data. 3. Related work leverages mon

Maxmin qlearning controlling the estimation bias of qlearningHyunKyu Jeon

��

This document summarizes the Maxmin Q-learning paper published at ICLR 2020. Maxmin Q-learning aims to address the overestimation bias of Q-learning and underestimation bias of Double Q-learning by maintaining multiple Q-functions and using the minimum value across them for the target in the Q-learning update. It defines the action selection and target construction for the update based on taking the maximum over the minimum Q-value for each action. The algorithm initializes multiple Q-functions, selects a random subset to update using the maxmin target constructed from the minimum Q-values. This approach reduces the biases seen in prior methods.

Adversarial Attack in Neural Machine TranslationHyunKyu Jeon

��

십분딥러닝_19�崡��崡��ճ�䱷��HyunKyu Jeon

��

십분수학_Entropy and KL-DivergenceHyunKyu Jeon

��

(edited) 십분딥러닝_17_DIM(DeepInfoMax)HyunKyu Jeon

��

십분딥러닝_17_DIM(Deep InfoMax)HyunKyu Jeon

��

십분딥러닝_16_WGAN (Wasserstein GANs)HyunKyu Jeon

��

십분딥러닝_15_SSD(Single Shot Multibox Detector)HyunKyu Jeon

��

십분딥러닝_14_YOLO(You Only Look Once)HyunKyu Jeon

��

십분딥러닝_13_Transformer Networks (Self Attention)HyunKyu Jeon

��

십분딥러닝_12_어텐션(Attention Mechanism)HyunKyu Jeon

��

십분딥러닝_11_LSTM (Long Short Term Memory)HyunKyu Jeon

��

십분딥러닝_10��-�䱷��HyunKyu Jeon

��

십분딥러닝_9_VAE(Variational Autoencoder)HyunKyu Jeon

��

십분딥러닝_7��Ҵ�� (Edited)HyunKyu Jeon

��

십분딥러닝_8�崡�ܳٴǷ��Գ��ǻ��HyunKyu Jeon

��

십분딥러닝_6��鱷��HyunKyu Jeon

��

십분딥러닝_7��Ҵ��HyunKyu Jeon

��

십분수학��상관괶�계(��ǰ��پ��ǲ�)HyunKyu Jeon

��

십분수학��거리(�پ��ٲ��Գ��)HyunKyu Jeon

��

십분딥러닝_5_컨볼루션 신경망(CNNs)HyunKyu Jeon

��

�ݺ�ߣ

십분딥러닝_18_GumBolt (VAE with Boltzmann Machine)

Recommended

More Related Content

More from HyunKyu Jeon (20)