PyPI - x-transformers - Versions diffs - 2.11.17__tar.gz → 2.11.18__tar.gz - Mend

x-transformers 2.11.17tar.gz → 2.11.18tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of x-transformers might be problematic. Click here for more details.

Files changed (68) hide show

{x_transformers-2.11.17 → x_transformers-2.11.18}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: x-transformers
-Version: 2.11.17
+Version: 2.11.18
 Summary: X-Transformers
 Project-URL: Homepage, https://pypi.org/project/x-transformers/
 Project-URL: Repository, https://github.com/lucidrains/x-transformers
@@ -2607,4 +2607,14 @@ ids_out, num_out, is_number_mask = model.generate(start_ids, start_nums, 17)
 }
 ```
+```bibtex
+@article{elhage2022solu,
+    title   = {Softmax Linear Units},
+    author  = {Elhage, Nelson and Hume, Tristan and Olsson, Catherine and Nanda, Neel and Henighan, Tom and Johnston, Scott and ElShowk, Sheer and Joseph, Nicholas and DasSarma, Nova and Mann, Ben and Hernandez, Danny and Askell, Amanda and Ndousse, Kamal and Jones, Andy and Drain, Dawn and Chen, Anna and Bai, Yuntao and Ganguli, Deep and Lovitt, Liane and Hatfield-Dodds, Zac and Kernion, Jackson and Conerly, Tom and Kravec, Shauna and Fort, Stanislav and Kadavath, Saurav and Jacobson, Josh and Tran-Johnson, Eli and Kaplan, Jared and Clark, Jack and Brown, Tom and McCandlish, Sam and Amodei, Dario and Olah, Christopher},
+    year    = {2022},
+    journal = {Transformer Circuits Thread},
+    note    = {https://transformer-circuits.pub/2022/solu/index.html}
+}
+```
 *solve intelligence... then use that to solve everything else.* - Demis Hassabis

{x_transformers-2.11.17 → x_transformers-2.11.18}/README.md RENAMED Viewed

@@ -2558,4 +2558,14 @@ ids_out, num_out, is_number_mask = model.generate(start_ids, start_nums, 17)
 }
 ```
+```bibtex
+@article{elhage2022solu,
+    title   = {Softmax Linear Units},
+    author  = {Elhage, Nelson and Hume, Tristan and Olsson, Catherine and Nanda, Neel and Henighan, Tom and Johnston, Scott and ElShowk, Sheer and Joseph, Nicholas and DasSarma, Nova and Mann, Ben and Hernandez, Danny and Askell, Amanda and Ndousse, Kamal and Jones, Andy and Drain, Dawn and Chen, Anna and Bai, Yuntao and Ganguli, Deep and Lovitt, Liane and Hatfield-Dodds, Zac and Kernion, Jackson and Conerly, Tom and Kravec, Shauna and Fort, Stanislav and Kadavath, Saurav and Jacobson, Josh and Tran-Johnson, Eli and Kaplan, Jared and Clark, Jack and Brown, Tom and McCandlish, Sam and Amodei, Dario and Olah, Christopher},
+    year    = {2022},
+    journal = {Transformer Circuits Thread},
+    note    = {https://transformer-circuits.pub/2022/solu/index.html}
+}
+```
 *solve intelligence... then use that to solve everything else.* - Demis Hassabis

{x_transformers-2.11.17 → x_transformers-2.11.18}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "x-transformers"
-version = "2.11.17"
+version = "2.11.18"
 description = "X-Transformers"
 authors = [
     { name = "Phil Wang", email = "lucidrains@gmail.com" }

{x_transformers-2.11.17 → x_transformers-2.11.18}/tests/test_x_transformers.py RENAMED Viewed

@@ -1462,3 +1462,14 @@ def test_kv_input_residual():
     out = attn(tokens, context = context, cross_attn_kv_residuals = condition)
     assert tokens.shape == out.shape
+def test_solu():
+    attn = Decoder(
+        dim = 256,
+        depth = 2,
+        heads = 4,
+        ff_solu = True
+    )
+    tokens = torch.randn(3, 32, 256)
+    attn(tokens)

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/x_transformers.py RENAMED Viewed

@@ -275,6 +275,10 @@ class ReluSquared(Module):
     def forward(self, x):
         return F.relu(x) ** 2
+class SoLU(Module):
+    def forward(self, x):
+        return x.softmax(dim = -1) * x
 # embedding
 class TokenEmbedding(Module):
@@ -1239,6 +1243,7 @@ class FeedForward(Module):
         glu_mult_bias = False,
         swish = False,
         relu_squared = False,
+        solu = False,
         custom_activation = None,
         post_act_ln = False,
         dropout = 0.,
@@ -1250,10 +1255,14 @@ class FeedForward(Module):
         inner_dim = int(dim * mult)
         dim_out = default(dim_out, dim)
+        assert at_most_one_of(relu_squared, solu)
         if exists(custom_activation):
             activation = deepcopy(custom_activation)
         elif relu_squared:
             activation = ReluSquared()
+        elif solu:
+            activation = SoLU()
         elif swish:
             activation = nn.SiLU()
         else:

{x_transformers-2.11.17 → x_transformers-2.11.18}/.github/FUNDING.yml RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/.github/workflows/python-publish.yml RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/.github/workflows/python-test.yaml RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/.gitignore RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/LICENSE RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/data/README.md RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/data/enwik8.gz RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/all-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/attention-on-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/cosine-sim-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/deepnorm.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/dynamic-pos-bias-linear.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/dynamic-pos-bias-log.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/dynamic-pos-bias-sinusoidal.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/dynamic-pos-bias.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/enhanced-recurrence.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/fcm.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/ffglu.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/flash-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/gate_values.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/gating.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/length-extrapolation-scale.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/macaron-1.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/macaron-2.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/memory-transformer.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/normformer.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/pia.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/qknorm-analysis.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/resi_dual.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/residual_attn.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/rezero.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/rotary.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/sandwich-2.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/sandwich.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/sandwich_norm.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/scalenorm.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/talking-heads.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/topk-attention.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/images/xval.png RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/train_belief_state.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/train_copy.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/train_entropy_tokenizer.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/train_enwik8.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/train_free.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/train_gpt_vae.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/train_length_extrapolate.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/train_parity.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/train_with_muon.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/__init__.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/attend.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/autoregressive_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/belief_state_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/continuous.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/dpo.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/entropy_based_tokenizer.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/free_transformer.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/gpt_vae.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/multi_input.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/neo_mlp.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/nonautoregressive_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/up_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/xl_autoregressive_wrapper.py RENAMED Viewed

File without changes

{x_transformers-2.11.17 → x_transformers-2.11.18}/x_transformers/xval.py RENAMED Viewed

File without changes

x-transformers 2.11.17__tar.gz → 2.11.18__tar.gz

Potentially problematic release.

x-transformers 2.11.17tar.gz → 2.11.18tar.gz