[Yanel-commits] rev 26847 - in
public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep:
. revisions revisions/1186048243090 revisions/1186048324300
michi at wyona.com
michi at wyona.com
Tue Aug 21 17:28:06 CEST 2007
Author: michi
Date: 2007-08-21 17:28:05 +0200 (Tue, 21 Aug 2007)
New Revision: 26847
Added:
public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/
public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048243090/
public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048243090/content
public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048243090/meta
public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048324300/
public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048324300/content
public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048324300/meta
Modified:
public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/meta
Log:
nutch versions added
Modified: public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/meta
===================================================================
--- public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/meta 2007-08-21 15:27:30 UTC (rev 26846)
+++ public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/meta 2007-08-21 15:28:05 UTC (rev 26847)
@@ -1 +1,5 @@
yarep_type<string>:resource
+yarep_isCheckedOut<boolean>:false
+yarep_checkoutDate<date>:2007-08-02T11:50:43+0200
+yarep_checkoutUserID<string>:michi
+yarep_checkinDate<date>:2007-08-02T11:52:04+0200
Added: public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048243090/content
===================================================================
--- public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048243090/content (rev 0)
+++ public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048243090/content 2007-08-21 15:28:05 UTC (rev 26847)
@@ -0,0 +1,31 @@
+<?xml version="1.0"?>
+
+<html xmlns="http://www.w3.org/1999/xhtml">
+<head>
+ <title>Nutch Resource</title>
+ <link rel="neutron-introspection" type="application/neutron+xml" href="?yanel.resource.usecase=introspection"/></head>
+
+ <body>
+ <h1>Nutch Resource</h1>
+ <h2>Crawling</h2>
+ <h3>Configuration of Crawler</h3>
+ <p>See also <a href="http://lucene.apache.org/nutch/tutorial8.html">http://lucene.apache.org/nutch/tutorial8.html</a> for more information.</p>
+ <ul>
+ <li>URLs to start with:<ul>
+ <li>e.g. nutch-0.8.x/url/yanel-website.txt (http://yanel.wyona.org/)</li>
+ <li>e.g. nutch-0.8.x/url/yulup-website.txt (http://www.yulup.org/)</li>
+ </ul>
+ </li>
+ <li>The range of crawling resp. URLs to be parsed and followed (IMPORTANT: Both files below need to have an "accept hosts" entry):<ul>
+ <li>nutch-0.8.x/conf/crawl-urlfilter.txt (+^http://yanel.wyona.org/)</li>
+ <li>nutch-0.8.x/conf/regex-urlfilter.txt (+^http://yanel.wyona.org/)</li>
+ </ul>
+ </li>
+ <li>Depth of Crawling: crawl.sh (e.g. DEPTH=5)</li>
+ </ul>
+ <h3>Running Crawler</h3>
+ <ul>
+ <li>sh crawl.sh</li>
+ </ul>
+ </body>
+</html>
Added: public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048243090/meta
===================================================================
--- public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048243090/meta (rev 0)
+++ public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048243090/meta 2007-08-21 15:28:05 UTC (rev 26847)
@@ -0,0 +1,7 @@
+yarep_revisionComment<string>:initial revision
+yarep_type<string>:resource
+yarep_revisionCreator<string>:michi
+yarep_isCheckedOut<boolean>:false
+yarep_checkoutDate<date>:2007-08-02T11:50:43+0200
+yarep_checkoutUserID<string>:michi
+yarep_revisionCreationDate<date>:2007-08-02T11:50:43+0200
Added: public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048324300/content
===================================================================
--- public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048324300/content (rev 0)
+++ public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048324300/content 2007-08-21 15:28:05 UTC (rev 26847)
@@ -0,0 +1,31 @@
+<?xml version="1.0"?>
+
+<html xmlns="http://www.w3.org/1999/xhtml">
+<head>
+ <title>Nutch Resource</title>
+ <link rel="neutron-introspection" type="application/neutron+xml" href="?yanel.resource.usecase=introspection"/></head>
+
+ <body>
+ <h1>Nutch Resource</h1>
+ <h2>Crawling</h2>
+ <h3>Configuration of Nutch Crawler</h3>
+ <p>See also <a href="http://lucene.apache.org/nutch/tutorial8.html">http://lucene.apache.org/nutch/tutorial8.html</a> for more information.</p>
+ <ul>
+ <li>URLs to start with:<ul>
+ <li>e.g. nutch-0.8.x/url/yanel-website.txt (http://yanel.wyona.org/)</li>
+ <li>e.g. nutch-0.8.x/url/yulup-website.txt (http://www.yulup.org/)</li>
+ </ul>
+ </li>
+ <li>The range of crawling resp. URLs to be parsed and followed (IMPORTANT: Both files below need to have an "accept hosts" entry):<ul>
+ <li>nutch-0.8.x/conf/crawl-urlfilter.txt (+^http://yanel.wyona.org/)</li>
+ <li>nutch-0.8.x/conf/regex-urlfilter.txt (+^http://yanel.wyona.org/)</li>
+ </ul>
+ </li>
+ <li>Depth of Crawling: crawl.sh (e.g. DEPTH=5)</li>
+ </ul>
+ <h3>Running Nutch Crawler</h3>
+ <ul>
+ <li>sh crawl.sh</li>
+ </ul><h2>Searching</h2><h3>Configuration of Yanel Nutch Resource</h3>...
+ </body>
+</html>
\ No newline at end of file
Added: public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048324300/meta
===================================================================
--- public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048324300/meta (rev 0)
+++ public/yanel/trunk/src/realms/yanel-website/content/db7296de-85c2-4723-96e0-01d94c464467.yarep/revisions/1186048324300/meta 2007-08-21 15:28:05 UTC (rev 26847)
@@ -0,0 +1,9 @@
+yarep_revisionComment<string>:updated
+yarep_type<string>:resource
+yarep_isCheckedOut<boolean>:false
+yarep_revisionCreator<string>:michi
+workflow-date<date>:2007-08-02T11:52:04+0200
+yarep_checkoutDate<date>:2007-08-02T11:50:43+0200
+yarep_checkoutUserID<string>:michi
+workflow-state<string>:draft
+yarep_revisionCreationDate<date>:2007-08-02T11:52:04+0200
More information about the Yanel-commits
mailing list