twitter-text-simpleidn 3.0.0.0

Sign up to get free protection for your applications and to get access to all the features.
@@ -0,0 +1,7 @@
1
+ ---
2
+ SHA256:
3
+ metadata.gz: 39d873d6e264da34a8103823492efd957bc0d6a2bda31335600030b200e59d28
4
+ data.tar.gz: 3ae9dfced1cfece5ae01e3c4c5abca19dab386666bb6fbce0eb871a9817541e6
5
+ SHA512:
6
+ metadata.gz: 65041db94fe061de3a3b21c036028fddb807f6ce0881f66494cbbf3fd08d9c358f5003cc8afa9ea93c21d8e3e7642a9283779eb2ccfad91d507077ecb99a642c
7
+ data.tar.gz: 6bae480be2ffa3aa40a75a3533fe64ca60b491863b5b661c02ca4ad9201ec5d4963cad0e140616498b4e036a05d7f5b4333a9c5d57a5480e2448ce27f5640a6a
File without changes
@@ -0,0 +1,40 @@
1
+ *.gem
2
+ *.rbc
3
+ *.sw[a-p]
4
+ *.tmproj
5
+ *.tmproject
6
+ *.un~
7
+ *~
8
+ .DS_Store
9
+ .Spotlight-V100
10
+ .Trashes
11
+ ._*
12
+ .bundle
13
+ .config
14
+ .directory
15
+ .elc
16
+ .emacs.desktop
17
+ .emacs.desktop.lock
18
+ .redcar
19
+ .yardoc
20
+ Desktop.ini
21
+ Gemfile.lock
22
+ Icon?
23
+ InstalledFiles
24
+ Session.vim
25
+ Thumbs.db
26
+ \#*\#
27
+ _yardoc
28
+ auto-save-list
29
+ coverage
30
+ doc
31
+ lib/bundler/man
32
+ pkg
33
+ pkg/*
34
+ rdoc
35
+ spec/reports
36
+ test/tmp
37
+ test/version_tmp
38
+ tmp
39
+ tmtags
40
+ tramp
@@ -0,0 +1,3 @@
1
+ [submodule "test/twitter-text-conformance"]
2
+ path = test/twitter-text-conformance
3
+ url = git://github.com/twitter/twitter-text-conformance.git
data/.rspec ADDED
@@ -0,0 +1,2 @@
1
+ --color
2
+ --format=documentation
@@ -0,0 +1,35 @@
1
+ # Changelog
2
+ All notable changes to this project will be documented in this file.
3
+
4
+ ## [3.0.0]
5
+ ### Added
6
+ - New v3.json config file with emojiParsingEnabled config option. When
7
+ true, twitter-text will parse and discount emoji supported by the
8
+ twemoji library (see https://github.com/twitter/twemoji). The length
9
+ of these emoji will be the default weight (200 or two characters) even
10
+ if they contain multiple code points combined by zero-width
11
+ joiners. This means that emoji with skin tone and gender modifiers no
12
+ longer count as more characters than those without such modifiers.
13
+ ### Changed
14
+ - Updates known gTLDs to recognize recent additions by IANA (#261)
15
+
16
+ ## [2.1] - 2017-12-20
17
+ ### Added
18
+ - This CHANGELOG.md file
19
+
20
+ ### Changed
21
+ - Top-level namespace changed from `Twitter` to `Twitter::TwitterText`. This
22
+ resolves a namespace collision with the popular
23
+ [twitter gem](https://github.com/sferik/twitter). This is considered
24
+ a breaking change, so the version has been bumped to 2.1. This fixes
25
+ issue [#221](https://github.com/twitter/twitter-text/issues/221),
26
+ "NoMethodError Exception: undefined method `[]' for nil:NilClasswhen
27
+ using gem in rails app"
28
+
29
+ ## [2.0.2] - 2017-12-18
30
+ ### Changed
31
+ - Resolved issue
32
+ [#211](https://github.com/twitter/twitter-text/issues/211), "gem
33
+ breaks, asset file is a dangling symlink"
34
+ - config files, tld_lib.yml files now copied into the right place
35
+ - Rakefile now included `prebuild`, `clean` tasks
data/Gemfile ADDED
@@ -0,0 +1,4 @@
1
+ source "http://rubygems.org"
2
+
3
+ # Specify the gem's dependencies in twitter-text.gemspec
4
+ gemspec
data/LICENSE ADDED
@@ -0,0 +1,188 @@
1
+ Copyright 2011 Twitter, Inc.
2
+
3
+ Licensed under the Apache License, Version 2.0 (the "License");
4
+ you may not use this work except in compliance with the License.
5
+ You may obtain a copy of the License below, or at:
6
+
7
+ http://www.apache.org/licenses/LICENSE-2.0
8
+
9
+ Unless required by applicable law or agreed to in writing, software
10
+ distributed under the License is distributed on an "AS IS" BASIS,
11
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
12
+ See the License for the specific language governing permissions and
13
+ limitations under the License.
14
+
15
+ Apache License
16
+ Version 2.0, January 2004
17
+ http://www.apache.org/licenses/
18
+
19
+ TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
20
+
21
+ 1. Definitions.
22
+
23
+ "License" shall mean the terms and conditions for use, reproduction,
24
+ and distribution as defined by Sections 1 through 9 of this document.
25
+
26
+ "Licensor" shall mean the copyright owner or entity authorized by
27
+ the copyright owner that is granting the License.
28
+
29
+ "Legal Entity" shall mean the union of the acting entity and all
30
+ other entities that control, are controlled by, or are under common
31
+ control with that entity. For the purposes of this definition,
32
+ "control" means (i) the power, direct or indirect, to cause the
33
+ direction or management of such entity, whether by contract or
34
+ otherwise, or (ii) ownership of fifty percent (50%) or more of the
35
+ outstanding shares, or (iii) beneficial ownership of such entity.
36
+
37
+ "You" (or "Your") shall mean an individual or Legal Entity
38
+ exercising permissions granted by this License.
39
+
40
+ "Source" form shall mean the preferred form for making modifications,
41
+ including but not limited to software source code, documentation
42
+ source, and configuration files.
43
+
44
+ "Object" form shall mean any form resulting from mechanical
45
+ transformation or translation of a Source form, including but
46
+ not limited to compiled object code, generated documentation,
47
+ and conversions to other media types.
48
+
49
+ "Work" shall mean the work of authorship, whether in Source or
50
+ Object form, made available under the License, as indicated by a
51
+ copyright notice that is included in or attached to the work
52
+ (an example is provided in the Appendix below).
53
+
54
+ "Derivative Works" shall mean any work, whether in Source or Object
55
+ form, that is based on (or derived from) the Work and for which the
56
+ editorial revisions, annotations, elaborations, or other modifications
57
+ represent, as a whole, an original work of authorship. For the purposes
58
+ of this License, Derivative Works shall not include works that remain
59
+ separable from, or merely link (or bind by name) to the interfaces of,
60
+ the Work and Derivative Works thereof.
61
+
62
+ "Contribution" shall mean any work of authorship, including
63
+ the original version of the Work and any modifications or additions
64
+ to that Work or Derivative Works thereof, that is intentionally
65
+ submitted to Licensor for inclusion in the Work by the copyright owner
66
+ or by an individual or Legal Entity authorized to submit on behalf of
67
+ the copyright owner. For the purposes of this definition, "submitted"
68
+ means any form of electronic, verbal, or written communication sent
69
+ to the Licensor or its representatives, including but not limited to
70
+ communication on electronic mailing lists, source code control systems,
71
+ and issue tracking systems that are managed by, or on behalf of, the
72
+ Licensor for the purpose of discussing and improving the Work, but
73
+ excluding communication that is conspicuously marked or otherwise
74
+ designated in writing by the copyright owner as "Not a Contribution."
75
+
76
+ "Contributor" shall mean Licensor and any individual or Legal Entity
77
+ on behalf of whom a Contribution has been received by Licensor and
78
+ subsequently incorporated within the Work.
79
+
80
+ 2. Grant of Copyright License. Subject to the terms and conditions of
81
+ this License, each Contributor hereby grants to You a perpetual,
82
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
83
+ copyright license to reproduce, prepare Derivative Works of,
84
+ publicly display, publicly perform, sublicense, and distribute the
85
+ Work and such Derivative Works in Source or Object form.
86
+
87
+ 3. Grant of Patent License. Subject to the terms and conditions of
88
+ this License, each Contributor hereby grants to You a perpetual,
89
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
90
+ (except as stated in this section) patent license to make, have made,
91
+ use, offer to sell, sell, import, and otherwise transfer the Work,
92
+ where such license applies only to those patent claims licensable
93
+ by such Contributor that are necessarily infringed by their
94
+ Contribution(s) alone or by combination of their Contribution(s)
95
+ with the Work to which such Contribution(s) was submitted. If You
96
+ institute patent litigation against any entity (including a
97
+ cross-claim or counterclaim in a lawsuit) alleging that the Work
98
+ or a Contribution incorporated within the Work constitutes direct
99
+ or contributory patent infringement, then any patent licenses
100
+ granted to You under this License for that Work shall terminate
101
+ as of the date such litigation is filed.
102
+
103
+ 4. Redistribution. You may reproduce and distribute copies of the
104
+ Work or Derivative Works thereof in any medium, with or without
105
+ modifications, and in Source or Object form, provided that You
106
+ meet the following conditions:
107
+
108
+ (a) You must give any other recipients of the Work or
109
+ Derivative Works a copy of this License; and
110
+
111
+ (b) You must cause any modified files to carry prominent notices
112
+ stating that You changed the files; and
113
+
114
+ (c) You must retain, in the Source form of any Derivative Works
115
+ that You distribute, all copyright, patent, trademark, and
116
+ attribution notices from the Source form of the Work,
117
+ excluding those notices that do not pertain to any part of
118
+ the Derivative Works; and
119
+
120
+ (d) If the Work includes a "NOTICE" text file as part of its
121
+ distribution, then any Derivative Works that You distribute must
122
+ include a readable copy of the attribution notices contained
123
+ within such NOTICE file, excluding those notices that do not
124
+ pertain to any part of the Derivative Works, in at least one
125
+ of the following places: within a NOTICE text file distributed
126
+ as part of the Derivative Works; within the Source form or
127
+ documentation, if provided along with the Derivative Works; or,
128
+ within a display generated by the Derivative Works, if and
129
+ wherever such third-party notices normally appear. The contents
130
+ of the NOTICE file are for informational purposes only and
131
+ do not modify the License. You may add Your own attribution
132
+ notices within Derivative Works that You distribute, alongside
133
+ or as an addendum to the NOTICE text from the Work, provided
134
+ that such additional attribution notices cannot be construed
135
+ as modifying the License.
136
+
137
+ You may add Your own copyright statement to Your modifications and
138
+ may provide additional or different license terms and conditions
139
+ for use, reproduction, or distribution of Your modifications, or
140
+ for any such Derivative Works as a whole, provided Your use,
141
+ reproduction, and distribution of the Work otherwise complies with
142
+ the conditions stated in this License.
143
+
144
+ 5. Submission of Contributions. Unless You explicitly state otherwise,
145
+ any Contribution intentionally submitted for inclusion in the Work
146
+ by You to the Licensor shall be under the terms and conditions of
147
+ this License, without any additional terms or conditions.
148
+ Notwithstanding the above, nothing herein shall supersede or modify
149
+ the terms of any separate license agreement you may have executed
150
+ with Licensor regarding such Contributions.
151
+
152
+ 6. Trademarks. This License does not grant permission to use the trade
153
+ names, trademarks, service marks, or product names of the Licensor,
154
+ except as required for reasonable and customary use in describing the
155
+ origin of the Work and reproducing the content of the NOTICE file.
156
+
157
+ 7. Disclaimer of Warranty. Unless required by applicable law or
158
+ agreed to in writing, Licensor provides the Work (and each
159
+ Contributor provides its Contributions) on an "AS IS" BASIS,
160
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
161
+ implied, including, without limitation, any warranties or conditions
162
+ of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
163
+ PARTICULAR PURPOSE. You are solely responsible for determining the
164
+ appropriateness of using or redistributing the Work and assume any
165
+ risks associated with Your exercise of permissions under this License.
166
+
167
+ 8. Limitation of Liability. In no event and under no legal theory,
168
+ whether in tort (including negligence), contract, or otherwise,
169
+ unless required by applicable law (such as deliberate and grossly
170
+ negligent acts) or agreed to in writing, shall any Contributor be
171
+ liable to You for damages, including any direct, indirect, special,
172
+ incidental, or consequential damages of any character arising as a
173
+ result of this License or out of the use or inability to use the
174
+ Work (including but not limited to damages for loss of goodwill,
175
+ work stoppage, computer failure or malfunction, or any and all
176
+ other commercial damages or losses), even if such Contributor
177
+ has been advised of the possibility of such damages.
178
+
179
+ 9. Accepting Warranty or Additional Liability. While redistributing
180
+ the Work or Derivative Works thereof, You may choose to offer,
181
+ and charge a fee for, acceptance of support, warranty, indemnity,
182
+ or other liability obligations and/or rights consistent with this
183
+ License. However, in accepting such obligations, You may act only
184
+ on Your own behalf and on Your sole responsibility, not on behalf
185
+ of any other Contributor, and only if You agree to indemnify,
186
+ defend, and hold each Contributor harmless for any liability
187
+ incurred by, or claims asserted against, such Contributor by reason
188
+ of your accepting any such warranty or additional liability.
@@ -0,0 +1,193 @@
1
+ # twitter-text
2
+
3
+ ![](https://img.shields.io/gem/v/twitter-text.svg)
4
+
5
+ This is the Ruby implementation of the twitter-text parsing
6
+ library. The library has methods to parse Tweets and calculate length,
7
+ validity, parse @mentions, #hashtags, URLs, and more.
8
+
9
+ ## Setup
10
+
11
+ Installation uses bundler.
12
+
13
+ ```
14
+ % gem install bundler
15
+ % bundle install
16
+ ```
17
+
18
+ ## Conformance tests
19
+
20
+ To run the Conformance test suite from the command line via rake:
21
+
22
+ ```
23
+ % rake test:conformance:run
24
+ ```
25
+
26
+ You can also run the rspec tests in the `spec` directory:
27
+
28
+ ```
29
+ % rspec spec
30
+ ```
31
+
32
+ # Length validation
33
+
34
+ twitter-text 2.0 introduces configuration files that define how Tweets
35
+ are parsed for length. This allows for backwards compatibility and
36
+ flexibility going forward. Old-style traditional 140-character parsing
37
+ is defined by the v1.json configuration file, whereas v2.json is
38
+ updated for "weighted" Tweets where ranges of Unicode code points can
39
+ have independent weights aside from the default weight. The sum of all
40
+ code points, each weighted appropriately, should not exceed the max
41
+ weighted length.
42
+
43
+ Some old methods from twitter-text 1.0 have been marked deprecated,
44
+ such as the `tweet_length()` method. The new API is based on the
45
+ following method, `parse_tweet()`
46
+
47
+ ```ruby
48
+ def parse_tweet(text, options = {}) { ... }
49
+ ```
50
+
51
+ This method takes a string as input and returns a results object that
52
+ contains information about the
53
+ string. `Twitter::TwitterText::Validation::ParseResults` object includes:
54
+
55
+ * `:weighted_length`: the overall length of the tweet with code points
56
+ weighted per the ranges defined in the configuration file.
57
+
58
+ * `:permillage`: indicates the proportion (per thousand) of the weighted
59
+ length in comparison to the max weighted length. A value > 1000
60
+ indicates input text that is longer than the allowable maximum.
61
+
62
+ * `:valid`: indicates if input text length corresponds to a valid
63
+ result.
64
+
65
+ * `:display_range_start, :display_range_end`: An array of two unicode code point
66
+ indices identifying the inclusive start and exclusive end of the
67
+ displayable content of the Tweet. For more information, see
68
+ the description of `display_text_range` here:
69
+ [Tweet updates](https://developer.twitter.com/en/docs/tweets/tweet-updates)
70
+
71
+ * `:valid_range_start, :valid_range_end`: An array of two unicode code point
72
+ indices identifying the inclusive start and exclusive end of the valid
73
+ content of the Tweet. For more information on the extended Tweet
74
+ payload see [Tweet updates](https://developer.twitter.com/en/docs/tweets/tweet-updates)
75
+
76
+ ## Extraction Examples
77
+
78
+ # Extraction
79
+ ```ruby
80
+ class MyClass
81
+ include Twitter::TwitterText::Extractor
82
+ usernames = extract_mentioned_screen_names("Mentioning @twitter and @jack")
83
+ # usernames = ["twitter", "jack"]
84
+ end
85
+ ```
86
+
87
+ ### Extraction with a block argument
88
+
89
+ ```ruby
90
+ class MyClass
91
+ include Twitter::TwitterText::Extractor
92
+ extract_reply_screen_name("@twitter are you hiring?").do |username|
93
+ # username = "twitter"
94
+ end
95
+ end
96
+ ```
97
+
98
+ ## Auto-linking Examples
99
+
100
+ ### Auto-link
101
+
102
+ ```ruby
103
+ class MyClass
104
+ include Twitter::TwitterText::Autolink
105
+
106
+ html = auto_link("link @user, please #request")
107
+ end
108
+ ```
109
+
110
+ ### For Ruby on Rails you want to add this to app/helpers/application_helper.rb
111
+ ```ruby
112
+ module ApplicationHelper
113
+ include Twitter::TwitterText::Autolink
114
+ end
115
+ ```
116
+
117
+ ### Now the auto_link function is available in every view. So in index.html.erb:
118
+ ```ruby
119
+ <%= auto_link("link @user, please #request") %>
120
+ ```
121
+
122
+ ### Usernames
123
+
124
+ Username extraction and linking matches all valid Twitter usernames but does
125
+ not verify that the username is a valid Twitter account.
126
+
127
+ ### Lists
128
+
129
+ Auto-link and extract list names when they are written in @user/list-name
130
+ format.
131
+
132
+ ### Hashtags
133
+
134
+ Auto-link and extract hashtags, where a hashtag can contain most letters or
135
+ numbers but cannot be solely numbers and cannot contain punctuation.
136
+
137
+ ### URLs
138
+
139
+ Asian languages like Chinese, Japanese or Korean may not use a delimiter such
140
+ as a space to separate normal text from URLs making it difficult to identify
141
+ where the URL ends and the text starts.
142
+
143
+ For this reason twitter-text currently does not support extracting or
144
+ auto-linking of URLs immediately followed by non-Latin characters.
145
+
146
+ Example: "http://twitter.com/は素晴らしい" . The normal text is "は素晴らしい" and is not
147
+ part of the URL even though it isn't space separated.
148
+
149
+ ### International
150
+
151
+ Special care has been taken to be sure that auto-linking and extraction work
152
+ in Tweets of all languages. This means that languages without spaces between
153
+ words should work equally well.
154
+
155
+ ### Hit Highlighting
156
+
157
+ Use to provide emphasis around the "hits" returned from the Search API, built
158
+ to work against text that has been auto-linked already.
159
+
160
+ ## Issues
161
+
162
+ Have a bug? Please create an issue here on GitHub!
163
+
164
+ <https://github.com/twitter/twitter-text/issues>
165
+
166
+ ## Authors
167
+
168
+ ### V2.0
169
+
170
+ * David LaMacchia (<https://github.com/dlamacchia>)
171
+ * Yoshimasa Niwa (<https://github.com/niw>)
172
+ * Sudheer Guntupalli (<https://github.com/sudhee>)
173
+ * Kaushik Lakshmikanth (<https://github.com/kaushlakers>)
174
+ * Jose Antonio Marquez Russo (<https://github.com/joseeight>)
175
+ * Lee Adams (<https://github.com/leeaustinadams>)
176
+
177
+ ### Previous authors
178
+
179
+ * Matt Sanford (<http://github.com/mzsanford>)
180
+ * Raffi Krikorian (<http://github.com/r>)
181
+ * Ben Cherry (<http://github.com/bcherry>)
182
+ * Patrick Ewing (<http://github.com/hoverbird>)
183
+ * Jeff Smick (<http://github.com/sprsquish>)
184
+ * Kenneth Kufluk (<https://github.com/kennethkufluk>)
185
+ * Keita Fujii (<https://github.com/keitaf>)
186
+ * Jean-Philippe Bougie (<http://github.com/jpbougie>)
187
+ * Erik Michaels-Ober (<https://github.com/sferik>)
188
+
189
+ ## License
190
+
191
+ Copyright 2012-2018 Twitter, Inc and other contributors
192
+
193
+ Licensed under the [Apache License, Version 2.0](http://www.apache.org/licenses/LICENSE-2.0)