{"id":3678,"date":"2012-08-29T10:58:43","date_gmt":"2012-08-28T16:58:43","guid":{"rendered":"http:\/\/www.moonmile.net\/blog\/archives\/3678"},"modified":"2012-08-29T11:00:30","modified_gmt":"2012-08-29T02:00:30","slug":"c-htmldom-%e3%81%ae%e3%83%91%e3%83%bc%e3%82%b9%e9%83%a8%e5%88%86%e3%82%92-c-%e3%81%a7%e6%9b%b8%e3%81%8d%e7%9b%b4%e3%81%99","status":"publish","type":"post","link":"http:\/\/www.moonmile.net\/blog\/archives\/3678","title":{"rendered":"[C++] HtmlDom \u306e\u30d1\u30fc\u30b9\u90e8\u5206\u3092 C++ \u3067\u66f8\u304d\u76f4\u3059"},"content":{"rendered":"<p>HtmlDom \u306f LINQ to HTML \u3092\u76ee\u6307\u3057\u3066\u3044\u307e\u3059\u304c\u3001\u304b\u3064 HTML \u304c\u697d\u306b\u7de8\u96c6\u3067\u304d\u308b\u3088\u3046\u306b\u66f4\u65b0\u7cfb\uff08Update\/Delete\/Insert\u306a\u3069\uff09\u306e\u30e1\u30bd\u30c3\u30c9\u3082\u6e96\u5099\u3057\u307e\u3059\u3002<\/p>\n<p>\u307e\u3042\u3001\u5185\u90e8\u7684\u306b\u306f XML \u306b\u76f4\u3057\u3066\u3044\u308b\u306e\u3067\u64cd\u4f5c\u306f\u697d\u306a\u306e\u3067\u3059\u304c\u3001\u306a\u3093\u3068 HTML \u306e\u30d1\u30fc\u30b9\u90e8\u5206\u304c\u3061\u3068\u9762\u5012\u3067\u3002<\/p>\n<p>\u3082\u3068\u3082\u3068\u3042\u308b System.Forms.HtmlDocument \u81ea\u4f53\u306b\u306f\u3001Children \u306b\u76f8\u5f53\u3059\u308b\u30b3\u30ec\u30af\u30b7\u30e7\u30f3\u304c\u306a\u3044\u306e\u3067\u3001\u5168 DOM \u3092\u53d6\u308b\u3053\u3068\u304c\u3067\u304d\u306a\u3044\u3093\u3067\u3059\u3088\u306d\u3002<\/p>\n<p>HtmlDocument \u30af\u30e9\u30b9 (System.Windows.Forms)<br \/>\n<a href=\"http:\/\/msdn.microsoft.com\/ja-jp\/library\/system.windows.forms.htmldocument(v=vs.110).aspx\">http:\/\/msdn.microsoft.com\/ja-jp\/library\/system.windows.forms.htmldocument(v=vs.110).aspx<\/a><\/p>\n<p>\u30c8\u30ea\u30c3\u30ad\u30fc\u306a\u4f5c\u308a\u3092\u3059\u308c\u3070\u3001\u3053\u308c\u306b\u6cbf\u3063\u3066 LINQ \u3050\u3089\u3044\u306f\u4f5c\u308c\u308b\u306e\u3067\u3059\u304c\u3001\u3061\u3087\u3063\u3068\u4f7f\u3044\u3065\u3089\u3044\u3068\u3044\u3046\u3053\u3068\u3067\u3001\u72ec\u81ea\u306b HtmlDocument, HtmlNode \u3092\u4f5c\u3063\u3066\u3044\u307e\u3059\u3002<br \/>\n\u3053\u306e\u3068\u304d\u3001HTML \u6587\u5b57\u5217\u304b\u3089 COM \u306e IHTMLDocument2 \u3092\u4f7f\u3063\u3066\u30d1\u30fc\u30b9\u3059\u308b\u306e\u306f\u3053\u3093\u306a\u611f\u3058\u3002<\/p>\n<pre class=\"brush: csharp; title: ; notranslate\" title=\"\">\r\n\/\/\/ &lt;summary&gt;\r\n\/\/\/ Loading method\r\n\/\/\/ HtmlDocument to create a HTML string\r\n\/\/\/ &lt;\/summary&gt;\r\n\/\/\/ &lt;param name=&amp;quot;html&amp;quot;&gt;HTML string&lt;\/param&gt;\r\n\/\/\/ &lt;returns&gt;&lt;\/returns&gt;\r\npublic HtmlDocument LoadHtml(string html)\r\n{\r\n\t\/\/ Creating an object using a mshtml.HTMLDocument\r\n\tvar doc = new HTMLDocument() as IHTMLDocument2;\r\n\tdoc.write(new object&#x5B;] { html });\r\n\tLoad(doc);\r\n\treturn this;\r\n}\r\n<\/pre>\n<p>\u975e\u5e38\u306b\u7c21\u5358\u3067\u3001IHTMLDocument2 \u30a4\u30f3\u30bf\u30fc\u30d5\u30a7\u30fc\u30b9\u306b\u30ad\u30e3\u30b9\u30c8\u3057\u3066\u3001COM \u306e write \u30e1\u30bd\u30c3\u30c9\u3092\u547c\u3073\u51fa\u3059\u3060\u3051\u3067\u3059\u3002\u3053\u308c\u306f HTML DOM \u306e document.write \u306b\u5bfe\u5fdc\u3057\u3066\u3044\u308b\u306e\u3067\u3001javascript \u307e\u3067\u5b9f\u884c\u3055\u308c\u3066\u3057\u307e\u3046\u306e\u3067\u3059\u304c\u3001\u307e\u3041\u3001\u5927\u4e08\u592b\u307f\u305f\u3044\u3067\u3059\u3002\u4f55\u6545\u304b\u3001COM \u3067\u76f4\u63a5\u547c\u3073\u51fa\u3057\u305f\u6642\u306f\u3001javascript \u5b9f\u884c\u3067\u30a8\u30e9\u30fc\u306b\u306a\u308b\uff08\u753b\u9762\u306eUI\u30b3\u30f3\u30c8\u30ed\u30fc\u30eb\u3092\u63a2\u3057\u3066\u30a8\u30e9\u30fc\u306b\u306a\u308b\u3068\u3044\u3046\u4e0d\u5177\u5408\u306e\u3088\u3046\u3067\u3059\uff09\u3089\u3057\u3044\u306e\u3067\u3059\u304c\u3001\u5b9f\u306f\u5927\u4e08\u592b\u3067\u3059\u3002<\/p>\n<pre class=\"brush: csharp; title: ; notranslate\" title=\"\">\r\n\tCComPtr&lt;IHTMLDocument2&gt; pDoc;\r\n\tHRESULT hr = CoCreateInstance(CLSID_HTMLDocument, NULL, CLSCTX_INPROC_SERVER, IID_IHTMLDocument2, (void**)&amp;pDoc);\r\n\t\/\/put the code into SAFEARRAY and write it into document\r\n\tSAFEARRAY* psa = SafeArrayCreateVector(VT_VARIANT, 0, 1);\r\n\tVARIANT *param;\r\n\thr = SafeArrayAccessData(psa, (LPVOID*)&amp;param);\r\n\tparam-&gt;vt = VT_BSTR;\r\n\tparam-&gt;bstrVal = CComBSTR(strHTMLCode).Copy();\r\n\thr = pDoc-&gt;write(psa);\r\n\thr = pDoc-&gt;close();\r\n\tSafeArrayDestroy(psa);\r\n<\/pre>\n<p>ATL COM \u3092\u4f7f\u3063\u3066\u3044\u307e\u3059\u3002\u3069\u3063\u304b\u306e\u30b5\u30f3\u30d7\u30eb\u304b\u3089\u53d6\u3063\u3066\u304d\u305f\u306e\u3067\u3001\u5b9f\u306f SafeArrayCreateVector \u306f\u5fc5\u8981\u306a\u3044\u304b\u3082\u3057\u308c\u307e\u305b\u3093\u3002\u30b5\u30f3\u30d7\u30eb\u3092\u52d5\u4f5c\u3055\u305b\u308b\u3068\u30a8\u30e9\u30fc\u306b\u306a\u3063\u3066\u3044\u305f\u306e\u3067\u3059\u304c\u300cCComBSTR(strHTMLCode).Copy();\u300d\u306e\u3088\u3046\u306b\u3001\u4e00\u5ea6\u30b3\u30d4\u30fc\u3092\u53d6\u308b\u3053\u3068\u3067\u3001\u3046\u307e\u304f\u5b9f\u884c\u3067\u304d\u307e\u3059\u3002\u305f\u3081\u3057\u306b\u3001\u81ea\u5206\u306e twitter \u30b5\u30a4\u30c8\u304b\u3089 HTML \u3092\u30c0\u30a6\u30f3\u30ed\u30fc\u30c9\u3057\u3066\u6765\u3066\u30d1\u30fc\u30b9\u3059\u308b\u3068\u3046\u307e\u304f\u52d5\u304d\u307e\u3057\u305f\u3002twitter \u30b5\u30a4\u30c8\u306f javascript \u3092\u591a\u7528\u3057\u3066\u3044\u308b\u306e\u3067\u3001\u3053\u308c\u304c\u5927\u4e08\u592b\u306a\u3089\u3070\u5927\u62b5\u306e\u30b5\u30a4\u30c8\u306f\u5927\u4e08\u592b\u3060\u3068\u601d\u3044\u307e\u3059\u3002<\/p>\n<p>\u809d\u5fc3\u306e\u30b9\u30d4\u30fc\u30c9\u3067\u3059\u304c\u3001C# \u7d4c\u7531\u306e COM \u30a2\u30af\u30bb\u30b9\u306f\u4f55\u6545\u304b\u5c5e\u6027\u306e\u914d\u5217\u3092\u53d6\u308b\u3068\u3053\u308d\u304c\u975e\u5e38\u306b\u9045\u304f\u306a\u3063\u3066\u3044\u307e\u3059\u3002<\/p>\n<pre class=\"brush: csharp; title: ; notranslate\" title=\"\">\r\n\t\/\/ append attributes\r\n\tIHTMLAttributeCollection attrs = node.attributes;\r\n\tif (attrs != null)\r\n\t{\r\n\t\tforeach (IHTMLDOMAttribute at in attrs)\r\n\t\t{\r\n\t\t\tif (at.specified)\r\n\t\t\t{\r\n\t\t\t\tstring nodeValue = &quot;&quot;;\r\n\t\t\t\tif (at.nodeValue != null)\r\n\t\t\t\t\tnodeValue = at.nodeValue.ToString();\r\n\t\t\t\tnn.Attrs.Add(new HtmlAttr { Key = at.nodeName, Value = nodeValue });\r\n\t\t\t}\r\n\t\t}\r\n\t}\r\n<\/pre>\n<p>\u90e8\u5206\u7684\u306a\u30b3\u30fc\u30c9\u3067\u3059\u304c\u3001foreach \u306e\u30eb\u30fc\u30d7\u306e\u3068\u3053\u308d\u3067\u3001attrs \u306e\u8981\u7d20\u3092 150 \u7a0b\u5ea6\u5efb\u308b\u306e\u304c\u554f\u984c\u306a\u3088\u3046\u3067\u3059\u3002\u5b9f\u306f\u3001IHTMLDocument \u304c\u30d1\u30fc\u30b9\u3057\u305f\u5f8c\u306e\u8981\u7d20\u3067\u306f\u3001\u4f55\u6545\u304b\u5c5e\u6027\u306e\u30b3\u30ec\u30af\u30b7\u30e7\u30f3\u304c\u975e\u5e38\u306b\u305f\u304f\u3055\u3093\u7528\u610f\u3055\u308c\u3066\u3044\u308b\u306e\u3067\u3059\u3088\u306d\u3002\u304a\u305d\u3089\u304f onclick \u306a\u3069\u306e\u30d5\u30c3\u30af\u95a2\u6570\u306e\u305f\u3081\u306b\u7528\u610f\u3055\u308c\u3066\u3044\u308b\u3068\u601d\u3046\u306e\u3067\u3059\u304c\u3001\u9759\u7684\u306a\u30c7\u30fc\u30bf\u3092\u53d6\u308a\u305f\u3044\u5834\u5408\u306b\u306f\u3001\u3053\u308c\u304c\u4e0d\u8981\u3067\u3059\u3057\u975e\u5e38\u306b\u90aa\u9b54\u3067\u3059\u3002\u8981\u7d20\u6570\u304c2,3\u3057\u304b\u306a\u3044\u306e\u3067\u3001150\u7a0b\u5ea6\u306e\u30eb\u30fc\u30d7\u3092\u5efb\u3059\u3082\u306e\u3060\u304b\u3089\u3001\u8010\u3048\u304d\u308c\u306a\u3044\u4f4d\u304a\u305d\u304f\u306a\u308a\u307e\u3059\u3002<br \/>\n\u3053\u308c\u306f\u3001\u7c21\u5358\u306a HTML \u306e\u5834\u5408\u306b\u306f\u554f\u984c\u304c\u306a\u304f\u3066\u3001twitter \u306e HTML \u306e\u3088\u3046\u306b\u5927\u91cf\u306a HTML \u306e\u5834\u5408\u306b\u767a\u899a\u3057\u305f\u73fe\u8c61\u3067\u3059\u3002\u3053\u308c\u3067\u306f\u5b9f\u7528\u306b\u8010\u3048\u307e\u305b\u3093\u3002<\/p>\n<p>\u306a\u306e\u3067\u3001\u3058\u3083\u3042\u3001COM \u30a2\u30af\u30bb\u30b9\u81ea\u4f53\u3092\u9ad8\u901f\u5316\u3059\u308b\u305f\u3081\u306b\u3072\u3068\u307e\u305a\u3001C++ \u3067\u66f8\u3044\u3066\u307f\u305f\u306e\u304c\u4ee5\u4e0b\u3067\u3059\u3002<\/p>\n<pre class=\"brush: csharp; title: ; notranslate\" title=\"\">\r\nlist&lt;XAttr*&gt; *getAttrs( CComQIPtr&lt;IHTMLDOMNode&gt; node )\r\n{\r\n\tauto *xattrs = new list&lt;XAttr*&gt;();\r\n\r\n\tCComPtr&lt;IDispatch&gt; disp;\r\n\tnode-&gt;get_attributes( &amp;disp );\r\n\tCComQIPtr&lt;IHTMLAttributeCollection&gt; attrs = disp;\r\n\tif ( attrs ) {\r\n\t\tlong length = 0;\r\n\t\tattrs-&gt;get_length( &amp;length );\r\n\t\tCComPtr&lt;IDispatch&gt; dispa;\r\n\t\tfor ( int i=0; i&lt;length; i++ ) {\r\n\t\t\tCComVariant vt(i);\r\n\t\t\tattrs-&gt;item( &amp;vt, &amp;dispa );\r\n\t\t\tCComQIPtr&lt;IHTMLDOMAttribute&gt; attr = dispa ;\r\n\t\t\tif ( attr ) {\r\n\t\t\t\tVARIANT_BOOL vtb;\r\n\t\t\t\tattr-&gt;get_specified( &amp;vtb );\r\n\t\t\t\tif ( vtb ) {\r\n\t\t\t\t\tCComBSTR key;\r\n\t\t\t\t\tattr-&gt;get_nodeName( &amp;key );\r\n\t\t\t\t\tCComVariant value;\r\n\t\t\t\t\tattr-&gt;get_nodeValue( &amp;value );\r\n\r\n\t\t\t\t\txattrs-&gt;push_back(new XAttr( CString(key), CString(value.bstrVal)));\r\n\t\t\t\t}\r\n\t\t\t}\r\n\t\t\tattr.Release();\r\n\t\t\tdispa.Release();\r\n\t\t}\r\n\t}\r\n\tattrs.Release();\r\n\tdisp.Release();\r\n\treturn xattrs;\r\n}\r\n<\/pre>\n<p>\u30eb\u30fc\u30d7\u5909\u6570\u3068\u306a\u308b length \u306e\u5024\u306f 150 \u7a0b\u5ea6\u306a\u306e\u3067\u540c\u3058\u3050\u3089\u3044\u30eb\u30fc\u30d7\u304c\u5efb\u3063\u3066\u3044\u307e\u3059\u304c\u3001\u975e\u5e38\u306b\u9ad8\u901f\u306b\u52d5\u304d\u307e\u3059\u3002\u591a\u5206\u3001CComBSTR \u304b CComVariant \u3068 .NET \u3068\u306e\u76f8\u4e92\u5909\u63db\u306e\u90e8\u5206\u3067\u9045\u304f\u306a\u3063\u3066\u3044\u308b\u611f\u3058\u304c\u3057\u307e\u3059\u3002\u3053\u308c\u306f\u5f8c\u3067\u5b9f\u6e2c\u3057\u3066\u307f\u308b\u3064\u3082\u308a\u3067\u3059\u3002<\/p>\n<p>\u3068\u3044\u3046\u8a33\u3067\u3001IE \u3067\u4f7f\u3063\u3066\u3044\u308b IHTMLDocument2 \u3092\u76f4\u63a5\u4f7f\u3063\u3066\u81ea\u524d\u3067 DOM \u3092\u4f5c\u308b\u3053\u3068\u304c\u3067\u304d\u307e\u3057\u305f\u3002<br \/>\n\u3067\u3059\u304c\u3001\u3053\u306e\u30d1\u30fc\u30b9\u90e8\u5206\u306f C++ \u3068\u306a\u3063\u3066\u3044\u308b\u306e\u3067\u3001C# \u306e HtmlNode \u30aa\u30d6\u30b8\u30a7\u30af\u30c8\u306b\u3057\u306a\u3044\u3068 LINQ \u304c\u4f7f\u3048\u306a\u3044\u3067\u3059\u3088\u306d\u3002<\/p>\n<p>\u3063\u3066\u8a33\u3067\u3001C++\/CLI \u306e\u51fa\u756a\u306a\u3093\u3067\u3059\u3088\u3002\u3048\u3048\u3001VS2010 \u3067\u306f C++\/CLI \u306e\u30a4\u30f3\u30c6\u30ea\u30bb\u30f3\u30b9\u304c\u52b9\u304b\u306a\u3044\u306e\u3067\u3001VS2012 \u3067\u66f8\u3044\u305f\u3082\u306e\u3092 VS2010 \u306b\u623b\u3057\u307e\u3059\u3063\u3066\u306a\u611f\u3058\u3067\u3059\u3002\u672c\u5f53\u306f 2008 \u304c\u826f\u3044\u3093\u3067\u3059\u304c\u3001\u9593\u9055\u3063\u3066\u30a2\u30f3\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u3057\u3061\u3083\u3063\u305f\u3093\u3067\u3059\u3088\u306d\u3002\u306a\u306e\u3067\u3001\u4ed5\u65b9\u304c\u7121\u304f\u5225\u30de\u30b7\u30f3\u306e VS2012 \u3092\u501f\u308a\u308b\u3068\u3044\u3046\u7f70\u30b2\u30fc\u30e0\u306b\uff08VS2010 \u306b VS2012 \u3092\u5165\u308c\u308b\u3068 MSTest \u304c\u6b63\u5e38\u306b\u52d5\u304b\u306a\u3044\u3093\u3067\u3059\u304c\u3001\u3053\u308c\u306f RTM \u3067\u306a\u304a\u3063\u3066\u3044\u308b\u3093\u3067\u3057\u3087\u3046\u304b\uff1f\uff09<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>HtmlDom \u306f LINQ to HTML \u3092\u76ee\u6307\u3057\u3066\u3044\u307e\u3059\u304c\u3001\u304b\u3064 HTML \u304c\u697d\u306b\u7de8\u96c6\u3067\u304d\u308b\u3088\u3046\u306b\u66f4\u65b0\u7cfb\uff08Update\/Delete\/Insert\u306a\u3069\uff09\u306e\u30e1\u30bd\u30c3\u30c9\u3082\u6e96\u5099\u3057\u307e\u3059\u3002 \u307e\u3042\u3001\u5185\u90e8\u7684\u306b\u306f XML \u306b\u76f4\u3057\u3066\u3044 &hellip; <a href=\"http:\/\/www.moonmile.net\/blog\/archives\/3678\">\u7d9a\u304d\u3092\u8aad\u3080 <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[22],"tags":[],"class_list":["post-3678","post","type-post","status-publish","format-standard","hentry","category-c"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/posts\/3678","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/comments?post=3678"}],"version-history":[{"count":1,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/posts\/3678\/revisions"}],"predecessor-version":[{"id":3679,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/posts\/3678\/revisions\/3679"}],"wp:attachment":[{"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/media?parent=3678"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/categories?post=3678"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.moonmile.net\/blog\/wp-json\/wp\/v2\/tags?post=3678"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}