PyPI - x-transformers - Versions diffs - 2.11.15__tar.gz → 2.11.17__tar.gz - Mend

x-transformers 2.11.15tar.gz → 2.11.17tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of x-transformers might be problematic. Click here for more details.

Files changed (68) hide show

{x_transformers-2.11.15 → x_transformers-2.11.17}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: x-transformers
-Version: 2.11.15
+Version: 2.11.17
 Summary: X-Transformers
 Project-URL: Homepage, https://pypi.org/project/x-transformers/
 Project-URL: Repository, https://github.com/lucidrains/x-transformers

{x_transformers-2.11.15 → x_transformers-2.11.17}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "x-transformers"
-version = "2.11.15"
+version = "2.11.17"
 description = "X-Transformers"
 authors = [
     { name = "Phil Wang", email = "lucidrains@gmail.com" }

{x_transformers-2.11.15 → x_transformers-2.11.17}/tests/test_x_transformers.py RENAMED Viewed

@@ -1411,9 +1411,11 @@ def test_attn_negative_weights(
 @param('per_token_latents', (False, True))
 @param('dec_head_depth', (0, 4))
+@param('separate_seq_for_latents', (False, True))
 def test_free(
     dec_head_depth,
-    per_token_latents
+    per_token_latents,
+    separate_seq_for_latents
 ):
     from x_transformers.free_transformer import FreeTransformer
@@ -1432,7 +1434,9 @@ def test_free(
     seq = torch.randint(0, 256, (1, 1024))
-    loss, (ar_loss, aux_loss) = model(seq, return_all_losses = True)
+    separate_seq_for_latents = torch.randint(0, 256, (1, 32)) if separate_seq_for_latents else None
+    loss, (ar_loss, aux_loss) = model(seq, separate_seq_for_latents, return_all_losses = True)
     loss.backward()
     assert aux_loss.numel() == 1

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/free_transformer.py RENAMED Viewed

@@ -149,6 +149,7 @@ class FreeTransformer(Module):
         enc_kwargs: dict = dict(),
         dec_kwargs: dict = dict(),
         kl_loss_weight = 1.,
+        latent_dropout_prob = 0.,
         pad_id = -1,
         **kwargs
     ):
@@ -187,6 +188,8 @@ class FreeTransformer(Module):
         self.from_latent_to_condition = nn.Linear(self.binary_mapper.num_codes, dim, bias = False)
+        self.latent_dropout = nn.Dropout(latent_dropout_prob)
         self.decoder_head = Decoder(
             dim = dim,
             depth = dec_head_depth,
@@ -225,8 +228,11 @@ class FreeTransformer(Module):
         self,
         decoder_head_embeds,
         mask = None,
-        return_kl_loss = False
+        return_kl_loss = False,
+        per_token_latents = None
     ):
+        per_token_latents = default(per_token_latents, self.per_token_latents)
         batch, seq_len, device = *decoder_head_embeds.shape[:2], decoder_head_embeds.device
         query_tokens = repeat(self.query_token_for_latents, 'd -> b 1 d', b = batch)
@@ -235,7 +241,7 @@ class FreeTransformer(Module):
         # handle the interesting per query token latents, as in the paper
-        if self.per_token_latents:
+        if per_token_latents:
             query_tokens = repeat(query_tokens, 'b 1 d -> b n d', n = seq_len)
             rotary_pos = torch.arange(seq_len, device = device)
@@ -342,13 +348,13 @@ class FreeTransformer(Module):
     def forward(
         self,
         seq,
+        seq_for_latents = None,
         return_all_losses = False
     ):
         batch, device = seq.shape[0], seq.device
         seq, labels = seq[:, :-1], seq[:, 1:]
-        encoder_mask = seq != self.pad_id
         tokens = self.token_emb(seq)
@@ -357,9 +363,27 @@ class FreeTransformer(Module):
         if exists(self.decoder_head):
             tokens = self.decoder_head(tokens)
+        # determine whether to use a separate sequence for encoding latents
+        if exists(seq_for_latents):
+            tokens_for_latents = self.token_emb(seq_for_latents)
+            if exists(self.decoder_head):
+                tokens_for_latents = self.decoder_head(tokens_for_latents)
+            encoder_mask = seq_for_latents != self.pad_id
+            per_token_latents = False
+        else:
+            tokens_for_latents = tokens
+            encoder_mask = seq != self.pad_id
+            per_token_latents = None
         # get latent Z
-        latents, kl_loss = self.encode_to_latents(tokens, mask = encoder_mask, return_kl_loss = True)
+        latents, kl_loss = self.encode_to_latents(tokens_for_latents, mask = encoder_mask, per_token_latents = per_token_latents, return_kl_loss = True)
+        latents = self.latent_dropout(latents)
         condition = self.from_latent_to_condition(latents)

{x_transformers-2.11.15 → x_transformers-2.11.17}/.github/FUNDING.yml RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/.github/workflows/python-publish.yml RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/.github/workflows/python-test.yaml RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/.gitignore RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/LICENSE RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/README.md RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/data/README.md RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/data/enwik8.gz RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/all-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/attention-on-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/cosine-sim-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/deepnorm.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/dynamic-pos-bias-linear.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/dynamic-pos-bias-log.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/dynamic-pos-bias-sinusoidal.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/dynamic-pos-bias.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/enhanced-recurrence.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/fcm.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/ffglu.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/flash-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/gate_values.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/gating.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/length-extrapolation-scale.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/macaron-1.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/macaron-2.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/memory-transformer.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/normformer.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/pia.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/qknorm-analysis.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/resi_dual.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/residual_attn.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/rezero.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/rotary.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/sandwich-2.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/sandwich.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/sandwich_norm.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/scalenorm.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/talking-heads.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/topk-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/images/xval.png RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/train_belief_state.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/train_copy.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/train_entropy_tokenizer.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/train_enwik8.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/train_free.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/train_gpt_vae.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/train_length_extrapolate.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/train_parity.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/train_with_muon.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/__init__.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/attend.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/autoregressive_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/belief_state_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/continuous.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/dpo.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/entropy_based_tokenizer.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/gpt_vae.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/multi_input.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/neo_mlp.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/nonautoregressive_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/up_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/x_transformers.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/xl_autoregressive_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.15 → x_transformers-2.11.17}/x_transformers/xval.py RENAMED Viewed

File without changes

x-transformers 2.11.15__tar.gz → 2.11.17__tar.gz

Potentially problematic release.

x-transformers 2.11.15tar.gz → 2.11.17tar.gz