PyPI - arxiv-to-prompt - Versions diffs - 0.1.1__tar.gz → 0.2.1__tar.gz - Mend

arxiv-to-prompt 0.1.1tar.gz → 0.2.1tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (15) hide show

{arxiv_to_prompt-0.1.1/src/arxiv_to_prompt.egg-info → arxiv_to_prompt-0.2.1}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
-Metadata-Version: 2.2
+Metadata-Version: 2.4
 Name: arxiv-to-prompt
-Version: 0.1.1
+Version: 0.2.1
 Summary: transform arXiv papers into a single latex prompt for LLMs
 Author: Takashi Ishida
 License: MIT
@@ -15,15 +15,16 @@ Requires-Dist: requests>=2.25.0
 Provides-Extra: test
 Requires-Dist: pytest>=7.0.0; extra == "test"
 Requires-Dist: pytest-cov>=4.0.0; extra == "test"
+Dynamic: license-file
 # arxiv-to-prompt
-[![PyPI version](https://badge.fury.io/py/arxiv-to-prompt.svg?update=20250202)](https://pypi.org/project/arxiv-to-prompt/)
+[![PyPI version](https://badge.fury.io/py/arxiv-to-prompt.svg?update=20250307)](https://pypi.org/project/arxiv-to-prompt/)
 [![Tests](https://github.com/takashiishida/arxiv-to-prompt/actions/workflows/tests.yml/badge.svg)](https://github.com/takashiishida/arxiv-to-prompt/actions)
 [![License](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 [![Changelog](https://img.shields.io/github/v/release/takashiishida/arxiv-to-prompt?label=changelog)](https://github.com/takashiishida/arxiv-to-prompt/releases)
-A command-line tool to transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs questions about the paper. It downloads the source files, automatically finds the main tex file containing `\documentclass`, and flattens multiple files into a single coherent source by resolving `\input` and `\include` commands. The tool also provides an option to remove LaTeX comments from the output (which can be useful to shorten the prompt).
+A command-line tool to transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs questions about the paper. It downloads the source files, automatically finds the main tex file containing `\documentclass`, and flattens multiple files into a single coherent source by resolving `\input` and `\include` commands. The tool also provides options to remove LaTeX comments and appendix sections from the output (which can be useful to shorten the prompt).
 ### Installation
@@ -41,6 +42,12 @@ arxiv-to-prompt 2303.08774
 # Display LaTeX source without comments
 arxiv-to-prompt 2303.08774 --no-comments
+# Display LaTeX source without appendix sections
+arxiv-to-prompt 2303.08774 --no-appendix
+# Combine options (no comments and no appendix)
+arxiv-to-prompt 2303.08774 --no-comments --no-appendix
 # Copy to clipboard
 arxiv-to-prompt 2303.08774 | pbcopy
@@ -62,8 +69,23 @@ latex_source = process_latex_source("2303.08774")
 # Get LaTeX source without comments
 latex_source = process_latex_source("2303.08774", keep_comments=False)
+# Get LaTeX source without appendix sections
+latex_source = process_latex_source("2303.08774", remove_appendix_section=True)
+# Combine options (no comments and no appendix)
+latex_source = process_latex_source("2303.08774", keep_comments=False, remove_appendix_section=True)
 ```
+### Projects Using arxiv-to-prompt
+Here are some projects and use cases that leverage arxiv-to-prompt:
+- [arxiv-latex-mcp](https://github.com/takashiishida/arxiv-latex-mcp): MCP server that uses arxiv-to-prompt to fetch and process arXiv LaTeX sources for precise interpretation of mathematical expressions in scientific papers.
+- [arxiv-tex-ui](https://github.com/takashiishida/arxiv-tex-ui): chat with an llm about an arxiv paper by using the latex source.
+If you're using arxiv-to-prompt in your project, please submit a pull request to add it to this list!
 ### References
 - Inspired by [files-to-prompt](https://github.com/simonw/files-to-prompt).

{arxiv_to_prompt-0.1.1 → arxiv_to_prompt-0.2.1}/README.md RENAMED Viewed

@@ -1,11 +1,11 @@
 # arxiv-to-prompt
-[![PyPI version](https://badge.fury.io/py/arxiv-to-prompt.svg?update=20250202)](https://pypi.org/project/arxiv-to-prompt/)
+[![PyPI version](https://badge.fury.io/py/arxiv-to-prompt.svg?update=20250307)](https://pypi.org/project/arxiv-to-prompt/)
 [![Tests](https://github.com/takashiishida/arxiv-to-prompt/actions/workflows/tests.yml/badge.svg)](https://github.com/takashiishida/arxiv-to-prompt/actions)
 [![License](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 [![Changelog](https://img.shields.io/github/v/release/takashiishida/arxiv-to-prompt?label=changelog)](https://github.com/takashiishida/arxiv-to-prompt/releases)
-A command-line tool to transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs questions about the paper. It downloads the source files, automatically finds the main tex file containing `\documentclass`, and flattens multiple files into a single coherent source by resolving `\input` and `\include` commands. The tool also provides an option to remove LaTeX comments from the output (which can be useful to shorten the prompt).
+A command-line tool to transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs questions about the paper. It downloads the source files, automatically finds the main tex file containing `\documentclass`, and flattens multiple files into a single coherent source by resolving `\input` and `\include` commands. The tool also provides options to remove LaTeX comments and appendix sections from the output (which can be useful to shorten the prompt).
 ### Installation
@@ -23,6 +23,12 @@ arxiv-to-prompt 2303.08774
 # Display LaTeX source without comments
 arxiv-to-prompt 2303.08774 --no-comments
+# Display LaTeX source without appendix sections
+arxiv-to-prompt 2303.08774 --no-appendix
+# Combine options (no comments and no appendix)
+arxiv-to-prompt 2303.08774 --no-comments --no-appendix
 # Copy to clipboard
 arxiv-to-prompt 2303.08774 | pbcopy
@@ -44,9 +50,24 @@ latex_source = process_latex_source("2303.08774")
 # Get LaTeX source without comments
 latex_source = process_latex_source("2303.08774", keep_comments=False)
+# Get LaTeX source without appendix sections
+latex_source = process_latex_source("2303.08774", remove_appendix_section=True)
+# Combine options (no comments and no appendix)
+latex_source = process_latex_source("2303.08774", keep_comments=False, remove_appendix_section=True)
 ```
+### Projects Using arxiv-to-prompt
+Here are some projects and use cases that leverage arxiv-to-prompt:
+- [arxiv-latex-mcp](https://github.com/takashiishida/arxiv-latex-mcp): MCP server that uses arxiv-to-prompt to fetch and process arXiv LaTeX sources for precise interpretation of mathematical expressions in scientific papers.
+- [arxiv-tex-ui](https://github.com/takashiishida/arxiv-tex-ui): chat with an llm about an arxiv paper by using the latex source.
+If you're using arxiv-to-prompt in your project, please submit a pull request to add it to this list!
 ### References
 - Inspired by [files-to-prompt](https://github.com/simonw/files-to-prompt).
-- Reused some code from [paper2slides](https://github.com/takashiishida/paper2slides).
+- Reused some code from [paper2slides](https://github.com/takashiishida/paper2slides).

{arxiv_to_prompt-0.1.1 → arxiv_to_prompt-0.2.1}/pyproject.toml RENAMED Viewed

@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 [project]
 name = "arxiv-to-prompt"
-version = "0.1.1"
+version = "0.2.1"
 description = "transform arXiv papers into a single latex prompt for LLMs"
 readme = "README.md"
 authors = [{ name = "Takashi Ishida" }]

{arxiv_to_prompt-0.1.1 → arxiv_to_prompt-0.2.1}/src/arxiv_to_prompt/cli.py RENAMED Viewed

@@ -22,13 +22,19 @@ def main():
         help=f"Custom directory to store downloaded files (default: {default_cache})",
         default=None
     )
+    parser.add_argument(
+        "--no-appendix",
+        action="store_true",
+        help="Remove the appendix section and everything after it"
+    )
     args = parser.parse_args()
     content = process_latex_source(
         args.arxiv_id,
         keep_comments=not args.no_comments,
-        cache_dir=args.cache_dir
+        cache_dir=args.cache_dir,
+        remove_appendix_section=args.no_appendix
     )
     if content:
         print(content)

{arxiv_to_prompt-0.1.1 → arxiv_to_prompt-0.2.1}/src/arxiv_to_prompt/core.py RENAMED Viewed

@@ -140,6 +140,14 @@ def remove_comments_from_lines(text: str) -> str:
         result.append(''.join(cleaned_line).rstrip())
     return '\n'.join(result)
+def remove_appendix(text: str) -> str:
+    """Remove appendix section and everything after it."""
+    # Find the position of \appendix command
+    appendix_match = re.search(r'\\appendix\b', text)
+    if appendix_match:
+        return text[:appendix_match.start()].rstrip()
+    return text
 def flatten_tex(directory: str, main_file: str) -> str:
     """Combine all tex files into one, resolving inputs."""
     def process_file(file_path: str, processed_files: set) -> str:
@@ -184,7 +192,8 @@ def flatten_tex(directory: str, main_file: str) -> str:
                 # Process the command normally
                 input_file = match.group(1)
-                if not input_file.endswith('.tex'):
+                # Only add .tex extension if the file has no extension at all
+                if not os.path.splitext(input_file)[1]:
                     input_file += '.tex'
                 input_path = os.path.join(directory, input_file)
                 return process_file(input_path, processed_files)
@@ -201,7 +210,7 @@ def flatten_tex(directory: str, main_file: str) -> str:
 def process_latex_source(arxiv_id: str, keep_comments: bool = True,
                         cache_dir: Optional[str] = None,
-                        use_cache: bool = False) -> Optional[str]:
+                        use_cache: bool = False, remove_appendix_section: bool = False) -> Optional[str]:
     """
     Process LaTeX source files from arXiv and return the combined content.
@@ -210,6 +219,7 @@ def process_latex_source(arxiv_id: str, keep_comments: bool = True,
         keep_comments: Whether to keep LaTeX comments in the output
         cache_dir: Custom directory to store downloaded files
         use_cache: Whether to use cached files if they exist (default: False)
+        remove_appendix_section: Whether to remove the appendix section and everything after it
     Returns:
         The processed LaTeX content or None if processing fails
@@ -234,6 +244,10 @@ def process_latex_source(arxiv_id: str, keep_comments: bool = True,
     if not keep_comments:
         content = remove_comments_from_lines(content)
+    # Remove appendix if requested
+    if remove_appendix_section:
+        content = remove_appendix(content)
     return content
 def check_source_available(arxiv_id: str) -> bool:

{arxiv_to_prompt-0.1.1 → arxiv_to_prompt-0.2.1/src/arxiv_to_prompt.egg-info}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
-Metadata-Version: 2.2
+Metadata-Version: 2.4
 Name: arxiv-to-prompt
-Version: 0.1.1
+Version: 0.2.1
 Summary: transform arXiv papers into a single latex prompt for LLMs
 Author: Takashi Ishida
 License: MIT
@@ -15,15 +15,16 @@ Requires-Dist: requests>=2.25.0
 Provides-Extra: test
 Requires-Dist: pytest>=7.0.0; extra == "test"
 Requires-Dist: pytest-cov>=4.0.0; extra == "test"
+Dynamic: license-file
 # arxiv-to-prompt
-[![PyPI version](https://badge.fury.io/py/arxiv-to-prompt.svg?update=20250202)](https://pypi.org/project/arxiv-to-prompt/)
+[![PyPI version](https://badge.fury.io/py/arxiv-to-prompt.svg?update=20250307)](https://pypi.org/project/arxiv-to-prompt/)
 [![Tests](https://github.com/takashiishida/arxiv-to-prompt/actions/workflows/tests.yml/badge.svg)](https://github.com/takashiishida/arxiv-to-prompt/actions)
 [![License](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 [![Changelog](https://img.shields.io/github/v/release/takashiishida/arxiv-to-prompt?label=changelog)](https://github.com/takashiishida/arxiv-to-prompt/releases)
-A command-line tool to transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs questions about the paper. It downloads the source files, automatically finds the main tex file containing `\documentclass`, and flattens multiple files into a single coherent source by resolving `\input` and `\include` commands. The tool also provides an option to remove LaTeX comments from the output (which can be useful to shorten the prompt).
+A command-line tool to transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs questions about the paper. It downloads the source files, automatically finds the main tex file containing `\documentclass`, and flattens multiple files into a single coherent source by resolving `\input` and `\include` commands. The tool also provides options to remove LaTeX comments and appendix sections from the output (which can be useful to shorten the prompt).
 ### Installation
@@ -41,6 +42,12 @@ arxiv-to-prompt 2303.08774
 # Display LaTeX source without comments
 arxiv-to-prompt 2303.08774 --no-comments
+# Display LaTeX source without appendix sections
+arxiv-to-prompt 2303.08774 --no-appendix
+# Combine options (no comments and no appendix)
+arxiv-to-prompt 2303.08774 --no-comments --no-appendix
 # Copy to clipboard
 arxiv-to-prompt 2303.08774 | pbcopy
@@ -62,8 +69,23 @@ latex_source = process_latex_source("2303.08774")
 # Get LaTeX source without comments
 latex_source = process_latex_source("2303.08774", keep_comments=False)
+# Get LaTeX source without appendix sections
+latex_source = process_latex_source("2303.08774", remove_appendix_section=True)
+# Combine options (no comments and no appendix)
+latex_source = process_latex_source("2303.08774", keep_comments=False, remove_appendix_section=True)
 ```
+### Projects Using arxiv-to-prompt
+Here are some projects and use cases that leverage arxiv-to-prompt:
+- [arxiv-latex-mcp](https://github.com/takashiishida/arxiv-latex-mcp): MCP server that uses arxiv-to-prompt to fetch and process arXiv LaTeX sources for precise interpretation of mathematical expressions in scientific papers.
+- [arxiv-tex-ui](https://github.com/takashiishida/arxiv-tex-ui): chat with an llm about an arxiv paper by using the latex source.
+If you're using arxiv-to-prompt in your project, please submit a pull request to add it to this list!
 ### References
 - Inspired by [files-to-prompt](https://github.com/simonw/files-to-prompt).

{arxiv_to_prompt-0.1.1 → arxiv_to_prompt-0.2.1}/tests/test_core.py RENAMED Viewed

@@ -9,6 +9,7 @@ from arxiv_to_prompt.core import (
     remove_comments_from_lines,
     check_source_available,
     flatten_tex,
+    remove_appendix,
 )
 # Test fixtures
@@ -176,3 +177,97 @@ Text with escaped \\% and then % \\input{commented_file3}
     assert "\\include{commented_file2}" in result
     assert "\\input{commented_file3}" in result
     assert "\\input{nonexistent_file}" in result
+def test_remove_appendix():
+    """Test appendix removal functionality."""
+    test_cases = [
+        # Basic appendix removal
+        (
+            "Main content\n\n\\appendix\nAppendix content",
+            "Main content"
+        ),
+        # No appendix to remove
+        (
+            "Main content only",
+            "Main content only"
+        ),
+        # Appendix with sections
+        (
+            "Introduction\n\\section{Method}\nContent\n\\appendix\n\\section{Additional Info}\nMore stuff",
+            "Introduction\n\\section{Method}\nContent"
+        ),
+        # Multiple appendix commands (should remove from first one)
+        (
+            "Content\n\\appendix\nFirst appendix\n\\appendix\nSecond appendix",
+            "Content"
+        ),
+        # Appendix at the beginning
+        (
+            "\\appendix\nAll appendix content",
+            ""
+        ),
+    ]
+    for input_text, expected in test_cases:
+        result = remove_appendix(input_text)
+        assert result == expected, f"Failed for input: {input_text}"
+def test_process_latex_with_appendix_removal(sample_arxiv_id, temp_cache_dir):
+    """Test processing LaTeX source with appendix removal."""
+    # Test with appendix removal
+    result = process_latex_source(
+        sample_arxiv_id,
+        keep_comments=True,
+        cache_dir=str(temp_cache_dir),
+        remove_appendix_section=True
+    )
+    assert result is not None
+    assert "\\documentclass" in result
+    # Check that appendix was removed (if it existed)
+    assert "\\appendix" not in result
+def test_input_file_extensions(temp_cache_dir):
+    """Test that input files with existing extensions are not modified."""
+    # Create test directory and files
+    tex_dir = temp_cache_dir / "test_extensions"
+    tex_dir.mkdir(parents=True)
+    # Create main file with various input commands
+    main_file = tex_dir / "main.tex"
+    main_content = """\\documentclass{article}
+\\begin{document}
+\\input{chapter1}
+\\input{main.bbl}
+\\input{mystyle.sty}
+\\input{config.cls}
+\\input{already.tex}
+\\end{document}
+"""
+    main_file.write_text(main_content)
+    # Create the files that should be included
+    files_to_create = [
+        ("chapter1.tex", "Chapter 1 content"),
+        ("main.bbl", "Bibliography content"),
+        ("mystyle.sty", "Style content"),
+        ("config.cls", "Class content"),
+        ("already.tex", "Already tex content"),
+    ]
+    for filename, content in files_to_create:
+        file_path = tex_dir / filename
+        file_path.write_text(content)
+    # Run the flatten_tex function
+    result = flatten_tex(str(tex_dir), "main.tex")
+    # Check that all files were included correctly
+    assert "Chapter 1 content" in result
+    assert "Bibliography content" in result
+    assert "Style content" in result
+    assert "Class content" in result
+    assert "Already tex content" in result