RubyGems - tex_to_unicode - Versions diffs - 0.1.1 → 0.1.3 - Mend

tex_to_unicode 0.1.1 → 0.1.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (4) hide show

checksums.yaml +4 -4
data/README.md +14 -8
data/lib/tex_to_unicode/converter.rb +10 -0
metadata +3 -3

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: 8bb4dbd6c674b51978403322fa03e4855e4c597b8d29d2fe0ecd9a180042e1ed
-  data.tar.gz: fd8ee12ca84e575d3bde91d2f43f3940b7ebffb6b1b3a6aa0dbf2cf581fa087e
+  metadata.gz: 0d252355bf7d6843ae911db512e147434cad4d9e27232b7fc490be36b5347fd9
+  data.tar.gz: b440147cc001af9fae4dff17197c3c1fa42a97aaa047d4a79ab7ba59d680d7a3
 SHA512:
-  metadata.gz: 7c68e1d0ea72a31de50dcd8f3c67e6da4d14fd7a35bb174652e5174e41ac68f61970247ce8cc621b58277a4b84791f1038aa6c9848a72758bdaa3dc3831e6e4e
-  data.tar.gz: f179beb2acebf430c0a6eed4ac2163106d65d9667313e9a78c3e797cc973840f4e868f809205c792a2a753e7c13c4c93602f43a8640772220a046feeb3a7ce5e
+  metadata.gz: 6b9701d7934b7c210ce07b1970e4eda2c84eb02b436356cc41e9751927b00e67f35f3bddebf770ddd1cfe62262eb35139c53e82a7b07ba7ffa0e54f394e19b9c
+  data.tar.gz: 93e1cfb198af6380791ad579b5ca133a7527cf3055f66260b35f2d818a9032973d89fd618e46ae6298834db15eb1b53afca40d68baf495c73cc79a8af06f08eb

data/README.md CHANGED Viewed

@@ -42,7 +42,7 @@ tex_to_unicode 'A \rightarrow B \Rightarrow C'
 # Integrals and sums
 tex_to_unicode '\int_0^\infty x^2 dx'
-# Output: ∫₀^∞ x² dx
+# Output: ∫₀∞ x² dx
 # Set notation
 tex_to_unicode 'x \in \mathbb{R}, y \notin \emptyset'
@@ -65,9 +65,9 @@ result = TexToUnicode.convert('\alpha + \beta = \gamma')
 puts result  # => α + β = γ
 # Use in string interpolation
-formula = '\sum_{i=1}^n i = \frac{n(n+1)}{2}'
+formula = '\sum_{i=1}^n i'
 puts "The formula is: #{TexToUnicode.convert(formula)}"
-# Output: The formula is: ∑ᵢ₌₁ⁿ i = n(n+1)/2
+# Output: The formula is: ∑i=1ⁿ i
 ```
 ## Supported Symbols
@@ -102,9 +102,11 @@ The gem supports a wide range of TeX symbols including:
 - `\emptyset` (∅), `\therefore` (∴), `\because` (∵)
 ### Superscripts & Subscripts
-- `^0` through `^9` (⁰¹²³⁴⁵⁶⁷⁸⁹)
-- `_0` through `_9` (₀₁₂₃₄₅₆₇₈₉)
-- `^+`, `^-`, `^(`, `^)` and subscript equivalents
+- Superscripts: `^0` through `^9` (⁰¹²³⁴⁵⁶⁷⁸⁹), `^i` (ⁱ), `^n` (ⁿ)
+- Superscript symbols: `^+` (⁺), `^-` (⁻), `^=` (⁼), `^(` (⁽), `^)` (⁾)
+- Subscripts: `_0` through `_9` (₀₁₂₃₄₅₆₇₈₉)
+- Subscript symbols: `_+` (₊), `_-` (₋), `_=` (₌), `_(` (₍), `_)` (₎)
+- Note: Unicode has limited super/subscript characters; unsupported characters will display normally
 ### Brackets
 - `\langle`, `\rangle` (⟨⟩)
@@ -125,7 +127,7 @@ tex_to_unicode 'x = \frac{-b \pm \sqrt{b^2 - 4ac}}{2a}'
 # Euler's identity
 tex_to_unicode 'e^{i\pi} + 1 = 0'
-# Output: eⁱᵖⁱ + 1 = 0
+# Output: eⁱπ + 1 = 0
 # Set theory
 tex_to_unicode 'A \cup B = \{x : x \in A \lor x \in B\}'
@@ -142,7 +144,11 @@ While Unicode provides many mathematical symbols, some TeX constructs cannot be
 - Complex fractions are approximated
 - Matrices and arrays have limited support
 - Some accents and diacritics are approximated
-- Subscripts and superscripts support limited characters
+- **Subscripts and superscripts support limited characters** due to Unicode constraints:
+  - **Supported superscripts**: digits (⁰¹²³⁴⁵⁶⁷⁸⁹), letters i and n (ⁱⁿ), and symbols ⁺⁻⁼⁽⁾
+  - **Supported subscripts**: digits (₀₁₂₃₄₅₆₇₈₉) and symbols ₊₋₌₍₎
+  - Unsupported characters (like `^\infty`, `^\alpha`, `_\beta`) will have the `^` or `_` marker removed and display as regular characters
+  - Example: `\int_0^\infty` becomes `∫₀∞` (not `∫₀^∞` with superscript infinity, as Unicode has no superscript ∞)
 ## Development

data/lib/tex_to_unicode/converter.rb CHANGED Viewed

@@ -117,11 +117,21 @@ module TexToUnicode
     def self.convert(text)
       result = text.dup
+      # Remove braces used for grouping first (before conversions)
+      # This allows super/subscripts within braces to be processed
+      result.gsub!(/\{([^}]*)\}/) { $1 }
       # Sort by length (descending) to match longer patterns first
       TEX_TO_UNICODE.keys.sort_by { |k| -k.length }.each do |tex|
         result.gsub!(tex, TEX_TO_UNICODE[tex])
       end
+      # Remove unsupported superscript/subscript markers
+      # If a ^ or _ is still present after conversion, it means that character
+      # doesn't have a Unicode super/subscript equivalent, so we remove the marker
+      result.gsub!(/\^(.)/) { $1 }  # Remove ^ before any remaining character
+      result.gsub!(/_(.)/) { $1 }   # Remove _ before any remaining character
       result
     end
   end

metadata CHANGED Viewed

@@ -1,7 +1,7 @@
 --- !ruby/object:Gem::Specification
 name: tex_to_unicode
 version: !ruby/object:Gem::Version
-  version: 0.1.1
+  version: 0.1.3
 platform: ruby
 authors:
 - Thomas Powell
@@ -29,8 +29,8 @@ homepage: https://github.com/stringsn88keys/tex_to_unicode
 licenses:
 - MIT
 metadata:
-  source_code_uri: https://github.com/yourusername/tex_to_unicode
-  bug_tracker_uri: https://github.com/yourusername/tex_to_unicode/issues
+  source_code_uri: https://github.com/stringsn88keys/tex_to_unicode
+  bug_tracker_uri: https://github.com/stringsn88keys/tex_to_unicode/issues
 post_install_message:
 rdoc_options: []
 require_paths: