judgeval 0.0.1__py3-none-any.whl → 0.0.3__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: judgeval
3
- Version: 0.0.1
3
+ Version: 0.0.3
4
4
  Summary: Judgeval Package
5
5
  Project-URL: Homepage, https://github.com/JudgmentLabs/judgeval
6
6
  Project-URL: Issues, https://github.com/JudgmentLabs/judgeval/issues
@@ -10,19 +10,18 @@ License-File: LICENSE.md
10
10
  Classifier: Operating System :: OS Independent
11
11
  Classifier: Programming Language :: Python :: 3
12
12
  Requires-Python: >=3.11
13
- Requires-Dist: anthropic>=0.43.1
14
- Requires-Dist: deepeval>=2.1.8
15
- Requires-Dist: fastapi>=0.115.6
13
+ Requires-Dist: anthropic
14
+ Requires-Dist: fastapi
16
15
  Requires-Dist: langfuse==2.50.3
17
- Requires-Dist: litellm>=1.48.10
18
- Requires-Dist: openai>=1.47.1
19
- Requires-Dist: pandas>=2.2.3
20
- Requires-Dist: patronus>=0.0.17
16
+ Requires-Dist: litellm
17
+ Requires-Dist: openai
18
+ Requires-Dist: pandas
19
+ Requires-Dist: patronus
21
20
  Requires-Dist: python-dotenv==1.0.1
22
- Requires-Dist: requests>=2.32.3
23
- Requires-Dist: supabase>=2.11.0
24
- Requires-Dist: together>=1.3.11
25
- Requires-Dist: uvicorn>=0.34.0
21
+ Requires-Dist: requests
22
+ Requires-Dist: supabase
23
+ Requires-Dist: together
24
+ Requires-Dist: uvicorn
26
25
  Provides-Extra: dev
27
26
  Requires-Dist: pytest-asyncio>=0.25.0; extra == 'dev'
28
27
  Requires-Dist: pytest-mock>=3.14.0; extra == 'dev'
@@ -31,10 +30,4 @@ Description-Content-Type: text/markdown
31
30
 
32
31
  # judgeval
33
32
 
34
-
35
- TODOS
36
-
37
- 1. public interface for Example and Measurement objects
38
- 2. call to backend
39
- 3. datasets and logging
40
- 4. exporting to platform
33
+ Judgeval is a open-source evaluation framework for multi-agent LLM workflows, for both real-time and offline evaluations.
@@ -40,7 +40,7 @@ judgeval/scorers/judgeval_scorers/hallucination.py,sha256=7omZbrQWTgZ4Fnw8wQSgvs
40
40
  judgeval/scorers/judgeval_scorers/json_correctness.py,sha256=guGR5lgTtwG8qqVI7dFt6hHGEYM98XzZhqPzh69Ogqg,866
41
41
  judgeval/scorers/judgeval_scorers/summarization.py,sha256=OwF-sNtRg2HN6FoRCCYTow_SNfS9Rcxzb7ZDBTEld7k,445
42
42
  judgeval/scorers/judgeval_scorers/tool_correctness.py,sha256=7nAXrDKfPkeMzRLiNTevbUV5pjRXPTM2dbkfGRgSJ-s,452
43
- judgeval-0.0.1.dist-info/METADATA,sha256=4TZnY-nRVAgCKd9e-Ok_qr-B25sB39HYMvl-THUgHzg,1278
44
- judgeval-0.0.1.dist-info/WHEEL,sha256=qtCwoSJWgHk21S1Kb4ihdzI2rlJ1ZKaIurTj_ngOhyQ,87
45
- judgeval-0.0.1.dist-info/licenses/LICENSE.md,sha256=tKmCg7k5QOmxPK19XMfzim04QiQJPmgIm0pAn55IJwk,11352
46
- judgeval-0.0.1.dist-info/RECORD,,
43
+ judgeval-0.0.3.dist-info/METADATA,sha256=akLBzfSQQy8aTNLIuq0LR0M8mdWTM07Sdx7nEyU4FoA,1156
44
+ judgeval-0.0.3.dist-info/WHEEL,sha256=qtCwoSJWgHk21S1Kb4ihdzI2rlJ1ZKaIurTj_ngOhyQ,87
45
+ judgeval-0.0.3.dist-info/licenses/LICENSE.md,sha256=tKmCg7k5QOmxPK19XMfzim04QiQJPmgIm0pAn55IJwk,11352
46
+ judgeval-0.0.3.dist-info/RECORD,,