{"id":860,"date":"2010-05-21T09:53:01","date_gmt":"2010-05-21T00:53:01","guid":{"rendered":"http:\/\/www.moonmile.net\/blog\/archives\/860"},"modified":"2012-01-11T15:06:29","modified_gmt":"2012-01-11T06:06:29","slug":"%e6%8c%87%e5%ae%9a%e3%81%97%e3%81%9ftwitter%e3%82%a2%e3%82%ab%e3%82%a6%e3%83%b3%e3%83%88%e3%81%ae%e5%85%a8%e3%83%84%e3%82%a4%e3%83%bc%e3%83%88%e3%82%92%e5%8f%96%e5%be%97perl%e7%89%88","status":"publish","type":"post","link":"http:\/\/www.moonmile.net\/blog\/archives\/860","title":{"rendered":"\u6307\u5b9a\u3057\u305fTwitter\u30a2\u30ab\u30a6\u30f3\u30c8\u306e\u5168\u30c4\u30a4\u30fc\u30c8\u3092\u53d6\u5f97(perl\u7248)"},"content":{"rendered":"<p>Twitter\u30a2\u30ab\u30a6\u30f3\u30c8\u3092\u6307\u5b9a\u3057\u3066\u3001\u5168\u3066\u306e\u767a\u8a00\uff08\u30c4\u30a4\u30fc\u30c8\uff09\u3092\u53d6\u5f97\u3059\u308b\u305f\u3081\u306e perl \u30b9\u30af\u30ea\u30d7\u30c8\u3067\u3059\u3002<\/p>\n<p>\u76ee\u7684\u306f\u3001<\/p>\n<ul>\n<li>\u3042\u306a\u305f\u306e\u5f7c\u306e\u6d6e\u6c17\u8abf\u67fb<\/li>\n<\/ul>\n<p>\u3067\u3082\u3044\u3044\u3057\uff08\u7b11\uff09\u3001\u771f\u9762\u76ee\u306b\u89e3\u6790\u3057\u3066\u3082\u3044\u3044\u3057\u3002\u79c1\u306e\u5834\u5408\u306f\u3001\u5148\u306b\u66f8\u3044\u305f\u300c\u7d61\u3063\u305f\u30fc\u300d\u304c\u3089\u307f\u3067\u4f7f\u3046\u4e88\u5b9a\u3067\u3059\u3002<\/p>\n<p>twitter api \u3092\u4f7f\u3046\u3068\u30a2\u30af\u30bb\u30b9\u5236\u9650\uff081\u6642\u9593\u306b500\u56de\u3050\u3089\u3044\u304b\u306a\uff09\u3068\u306a\u308b\u306e\u3067\u3001\u516c\u5f0f\u30b5\u30a4\u30c8 <a href=\"http:\/\/twitter.com\/\">http:\/\/twitter.com\/<\/a> \u304b\u3089\u76f4\u63a5\u5f15\u3063\u5f35\u3063\u3066\u304d\u307e\u3059\u3002\u306a\u306e\u3067\u3001\u516c\u5f0f\u30b5\u30a4\u30c8\u306e\u5f62\u5f0f\u304c\u5909\u308f\u308b\u3068\u53d6\u308c\u306a\u304f\u306a\u308b\u3093\u3067\u3059\u304c\u3001\u307e\u3001\u3072\u3068\u307e\u305a\u3001\u3053\u3093\u306a\u611f\u3058\u3067\u53d6\u5f97\u3067\u304d\u308b\u3068\u3068\u3044\u3046\u4f8b\u3068\u3057\u3066\u3002<\/p>\n<p>\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u306f\u3001<\/p>\n<ul>\n<li>activeperl \u306a\u3069\u3092\u30c0\u30a6\u30f3\u30ed\u30fc\u30c9<\/li>\n<li>wget \u3092\u30c0\u30a6\u30f3\u30ed\u30fc\u30c9<br \/>\n\u5206\u304b\u308b\u4eba\u306f\u3001cUrl \u306b\u5909\u66f4\u3057\u3066\u3082 ok<\/li>\n<\/ul>\n<p>\u30b3\u30de\u30f3\u30c9\u30e9\u30a4\u30f3\u304b\u3089<\/p>\n<p>perl krmall.pl [\u30a2\u30ab\u30a6\u30f3\u30c8]<\/p>\n<p>\u3068\u3059\u308b\u3068\u3001<\/p>\n<ul>\n<li>\u5168\u767a\u8a00\u306e\u30d5\u30a1\u30a4\u30eb \u30a2\u30ab\u30a6\u30f3\u30c8.txt<\/li>\n<li>\u7d61\u3080\u4eba\u306e\u30d5\u30a1\u30a4\u30eb \u30a2\u30ab\u30a6\u30f3\u30c8_st.txt<\/li>\n<\/ul>\n<p>\u304c\u4f5c\u6210\u3055\u308c\u307e\u3059\u3002\u3053\u308c\u3092\u30e1\u30e2\u5e33\u3067\u898b\u308b\u3001\u3050\u3089\u3044\u3067\u3059\u306d\u3002HTML\u5f62\u5f0f\u306b\u5909\u63db\u3057\u76f4\u3059\u3068\u3001\u30ea\u30f3\u30af\u5148\u306b\u98db\u3079\u305f\u308a\u3057\u3066\u4fbf\u5229\u3067\u3059\u3002<\/p>\n<p>\u4f8b\u3048\u3070\u3001\u5b6b\u6b63\u7fa9\u3055\u3093\u306e\u5168\u30c4\u30a4\u30fc\u30c8\u3092\u53d6\u5f97\u3059\u308b\u5834\u5408\u306f\u3001<\/p>\n<blockquote><p><span style=\"background-color: #ffffff;\">perl krmall.pl masason <\/span><\/p><\/blockquote>\n<p>\u3068\u3057\u307e\u3059\u3002<\/p>\n<p>\u305d\u308c\u306a\u308a\u306b\u6642\u9593\u304c\u304b\u304b\u308a\u307e\u3059\u304c\u3001WEB\u30b5\u30a4\u30c8\u7b49\u3067\u30af\u30ea\u30c3\u30af\u3057\u3066\u3044\u304f\u3088\u308a\u3082\u4fbf\u5229\u3067\u3057\u3087\u3046\u3002<\/p>\n<p>\u3048\uff5e\u3068\u3001\u3069\u3046\u3044\u3046\u98a8\u306b\u53d6\u5f97\u3057\u3066\u3044\u308b\u304b\u3068\u3044\u3046\u3068\u3001<\/p>\n<ol>\n<li>\u6307\u5b9a\u3057\u305f\u30a2\u30ab\u30a6\u30f3\u30c8\u306e\u30c4\u30a4\u30fc\u30c8\u6570\u3092\u53d6\u5f97<br \/>\n<a href=\"http:\/\/twitter.com\/\">http:\/\/twitter.com\/<\/a>\u30a2\u30ab\u30a6\u30f3\u30c8<\/li>\n<li>\u5168\u30da\u30fc\u30b8\u6570 = \u30c4\u30a4\u30fc\u30c8\u6570\/20 + 1 \u3092\u8a08\u7b97\u3002<\/li>\n<li>\u30da\u30fc\u30b8\u6bce\u306b\u904e\u53bb\u3092\u53d6\u5f97<br \/>\n<a href=\"http:\/\/twitter.com\/\">http:\/\/twitter.com\/<\/a>\u30a2\u30ab\u30a6\u30f3\u30c8?page=n<\/li>\n<li>ID\u3084\u30c4\u30a4\u30fc\u30c8\u306a\u3069\u3092\u691c\u7d22\u3057\u3066\u53d6\u308a\u51fa\u3057\u3002<\/li>\n<\/ol>\n<p>\u306a\u611f\u3058\u3067\u3059\u3002\u3044\u308f\u3086\u308b\u30b5\u30a4\u30c8\u7d4c\u7531\u306e\u30af\u30ed\u30fc\u30ea\u30f3\u30b0\u3068\u540c\u3058\u3067\u3059\u3002<\/p>\n<p>\u4ee5\u4e0b\u306f\u3001\u30bd\u30fc\u30b9\u30b3\u30fc\u30c9\u3002<\/p>\n<pre class=\"brush: perl; title: ; notranslate\" title=\"\">\r\n# \u6307\u5b9a\u30a2\u30ab\u30a6\u30f3\u30c8\u306e\u5168\u767a\u8a00\u3092\u53d6\u5f97\r\n$user = $ARGV&#x5B;0];\u00a0# \u30a2\u30ab\u30a6\u30f3\u30c8\r\n$wget = &amp;quot;wget&amp;quot;; if ( $user eq &amp;quot;&amp;quot; ) {\r\nprint &amp;quot;perl krmall.pl &#x5B;\u30a2\u30ab\u30a6\u30f3\u30c8]&amp;quot;;\r\nexit;\r\n} # \u73fe\u5728\u306e\u30c4\u30a4\u30fc\u30c8\u6570\u3092\u53d6\u5f97\r\n`$wget http:\/\/twitter.com\/$user -Otemp.txt`;\r\nopen( FILE, &amp;quot;&lt;temp.txt&amp;quot; );\r\nwhile(&lt;FILE&gt;) {\r\nif ( \/&lt;span id=&amp;quot;update_count&amp;quot; class=&amp;quot;stat_count&amp;quot;&gt;(&#x5B;0-9,]+)&lt;\\\/span&gt;\/ ) {\r\n$cnt = $1;\r\n$cnt =~ s\/,\/\/;\r\nbreak;\r\n}\r\n}\r\nclose( FILE ); $pmax = int($cnt\/20)+1;\r\nprint &amp;quot;account: $user count: $cnt pages: $pmax\\n&amp;quot;;  # \u6307\u5b9a\u30a2\u30ab\u30a6\u30f3\u30c8\u3092\u5168\u3066\u8aad\u307f\u8fbc\u307f\r\nunlink( &amp;quot;$user.txt&amp;quot; );\r\nopen( OUT, &amp;quot;&gt;&gt;$user.txt&amp;quot; ); for ($i=1; $i&lt;=$pmax; $i++ ) {\r\n`$wget http:\/\/twitter.com\/$user?page=$i\u00a0 -Otemp.txt`;\r\n\r\nopen( FILE, &amp;quot;&lt;temp.txt&amp;quot; );\r\nwhile(&lt;FILE&gt;) {\r\nif ( \/&lt;span class=&amp;quot;entry-content&amp;quot;&gt;\/ ) {\r\n$text = $_;\r\nif ( !\/&lt;\\\/span&gt;\/ ) {\r\nwhile(&lt;FILE&gt;) {\r\nif ( \/&lt;\\\/span&gt;\/ ) {\r\n$text .= $_;\r\nlast;\r\n}\r\n$text .= $_;\r\n}\r\n}\r\n$text =~ s\/\\n\/\/g;\r\n$text =~ s\/\\r\/\/g;\r\n$text =~ \/&lt;span class=&amp;quot;entry-content&amp;quot;&gt;(.*)&lt;\\\/span&gt;\/;\r\n$text = $1;\r\n\r\n&lt;FILE&gt;; &lt;FILE&gt;;\r\n$id = &lt;FILE&gt;;\r\n$id =~ \/status\\\/(&#x5B;0-9]+)\/;\r\n$id = $1;\r\n$date = &lt;FILE&gt;;\r\n$date =~ \/data=&amp;quot;{time:&#039;(&#x5B;^&#039;]+)&#039;}\/;\u00a0 #&amp;quot;\r\n$date = $1;\r\n\r\n$text =~ s\/&lt;&#x5B;^&gt;]+&gt;\/\/g;\r\n\r\nprint OUT &amp;quot;---\\n&amp;quot;;\r\nprint OUT &amp;quot;$id\\n&amp;quot;;\r\nprint OUT &amp;quot;$text\\n&amp;quot;;\r\nprint OUT &amp;quot;$date\\n\\n&amp;quot;;\r\n}\r\n}\r\nclose( FILE );\r\n}\r\nclose( OUT ); # &amp;quot;\r\n# \u7d71\u8a08\u8868\u793a\r\nopen( OUT, &amp;quot;&lt;$user.txt&amp;quot;);\r\nopen( ST,\u00a0 &amp;quot;&gt;${user}_st.txt&amp;quot;);\r\nwhile(&lt;OUT&gt;) {\r\nif ( \/^---\/ ) {\r\n$_ = &lt;OUT&gt;; chomp; $id = $_;\r\n$_ = &lt;OUT&gt;; chomp; $text = $_;\r\n$_ = &lt;OUT&gt;; chomp; $date = $_;\r\n\r\n$_ = $text ;\r\n@res = \/(@&#x5B;A-Za-z0-9_]+)\/g;\r\nforeach $re ( @res ) {\r\n$users{ $re } = $users{ $re } + 1;\r\n}\r\n}\r\n}\r\nclose( OUT );\r\n\r\nforeach $re ( sort {$users{$b} &lt;=&gt; $users{$a}} keys %users ) {\r\nif ( $re ne &amp;quot;@&amp;quot;.$user ) {\r\nprint ST &amp;quot;$re (&amp;quot;.$users{$re}.&amp;quot;)\\n&amp;quot;;\r\n}\r\n}\r\nclose( ST );\r\n<\/pre>\n<p>&#8212;<br \/>\n2012\/01\/11 \u8ffd\u8a18<\/p>\n<p>\u4e0a\u8a18\u306e\u30b3\u30fc\u30c9\u3067\u306f\u6700\u65b0\u306e\u516c\u5f0f\u30af\u30e9\u30a4\u30a2\u30f3\u30c8\u3067\u306f\u52d5\u304b\u306a\u3044\u306e\u3067\u3001\u4e0b\u8a18\u3092\u8a66\u3057\u3066\u307f\u3066\u304f\u3060\u3055\u3044\u3002C# \u3067\u8a18\u8ff0\u3057\u3066\u3044\u307e\u3059\u3002<\/p>\n<p>\u6307\u5b9a\u3057\u305fTwitter\u30a2\u30ab\u30a6\u30f3\u30c8\u306e\u5168\u30c4\u30a4\u30fc\u30c8\u3092\u53d6\u5f97(\u66ab\u5b9a.NET\u7248) | Moonmile Solutions Blog<br \/>\n<a href=\"http:\/\/www.moonmile.net\/blog\/archives\/2851\">http:\/\/www.moonmile.net\/blog\/archives\/2851<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Twitter\u30a2\u30ab\u30a6\u30f3\u30c8\u3092\u6307\u5b9a\u3057\u3066\u3001\u5168\u3066\u306e\u767a\u8a00\uff08\u30c4\u30a4\u30fc\u30c8\uff09\u3092\u53d6\u5f97\u3059\u308b\u305f\u3081\u306e perl \u30b9\u30af\u30ea\u30d7\u30c8\u3067\u3059\u3002 \u76ee\u7684\u306f\u3001 \u3042\u306a\u305f\u306e\u5f7c\u306e\u6d6e\u6c17\u8abf\u67fb \u3067\u3082\u3044\u3044\u3057\uff08\u7b11\uff09\u3001\u771f\u9762\u76ee\u306b\u89e3\u6790\u3057\u3066\u3082\u3044\u3044\u3057\u3002\u79c1\u306e\u5834\u5408\u306f\u3001\u5148\u306b\u66f8\u3044\u305f\u300c\u7d61\u3063\u305f\u30fc\u300d\u304c\u3089\u307f &hellip; <a href=\"http:\/\/www.moonmile.net\/blog\/archives\/860\">\u7d9a\u304d\u3092\u8aad\u3080 <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[3],"tags":[],"class_list":["post-860","post","type-post","status-publish","format-standard","hentry","category-dev"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/posts\/860","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/comments?post=860"}],"version-history":[{"count":4,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/posts\/860\/revisions"}],"predecessor-version":[{"id":862,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/posts\/860\/revisions\/862"}],"wp:attachment":[{"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/media?parent=860"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/categories?post=860"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/tags?post=860"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}