cseg 0.0.4 → 0.0.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (4) hide show
  1. data/README.md +10 -3
  2. data/lib/cseg/version.rb +1 -1
  3. data/lib/cseg.rb +1 -0
  4. metadata +2 -2
data/README.md CHANGED
@@ -18,17 +18,24 @@ Or install it yourself as:
18
18
 
19
19
  $ gem install cseg
20
20
 
21
- you need to install CRF++ first and set the environment environment variables.
21
+ you need to install CRF++ first and set the environment variables.
22
22
 
23
23
  Here is the site of CRF++<http://crfpp.googlecode.com/svn/trunk/doc/index.html> and you should follow the manual
24
24
 
25
25
  On github the dictionary file was deleted since it is quite large, though you can get all from rubygems.
26
+
27
+ ## recall and Precision
28
+
29
+ Tested on seghanbakeoff pku test set
30
+
31
+ Precision: 94.43%
32
+
33
+ Recall: 92.86%
26
34
 
27
35
  ## Usage
28
36
 
29
37
  require "cseg"
30
- a=Kurumi.segment("屌丝是一种自我讽刺")
31
-
38
+ a=Kurumi.segment("屌丝是一种自我讽刺")
32
39
  =>屌丝/是/一/种/自我/讽刺
33
40
 
34
41
  the result will be an array.
data/lib/cseg/version.rb CHANGED
@@ -1,3 +1,3 @@
1
1
  module Cseg
2
- VERSION = "0.0.4"
2
+ VERSION = "0.0.5"
3
3
  end
data/lib/cseg.rb CHANGED
@@ -29,6 +29,7 @@ class Kurumi
29
29
  else
30
30
  if word!=""
31
31
  @result.push(word)
32
+ word=""
32
33
  end
33
34
  end
34
35
 
metadata CHANGED
@@ -1,7 +1,7 @@
1
1
  --- !ruby/object:Gem::Specification
2
2
  name: cseg
3
3
  version: !ruby/object:Gem::Version
4
- version: 0.0.4
4
+ version: 0.0.5
5
5
  prerelease:
6
6
  platform: ruby
7
7
  authors:
@@ -9,7 +9,7 @@ authors:
9
9
  autorequire:
10
10
  bindir: bin
11
11
  cert_chain: []
12
- date: 2014-02-17 00:00:00.000000000 Z
12
+ date: 2014-02-25 00:00:00.000000000 Z
13
13
  dependencies: []
14
14
  description: ! '"a chinese segmentation tool using CRF++"'
15
15
  email: