[Yanel-dev] Yarep Search and Tika
Michael Wechner
michael.wechner at wyona.com
Thu May 22 00:10:40 CEST 2008
Josias Thöny wrote:
> Michael Wechner wrote:
>
>> Hi
>>
>> I have noticed that
>>
>> src/impl/java/org/wyona/yarep/impl/repo/vfs/VirtualFileSystemOutputStream.java
>>
>>
>> is using Tika
>>
>> TikaConfig tikaConfig = TikaConfig.getDefaultConfig()
>>
>> but is it possible that Tika doesn't cover the mime type
>> application/xhtml+xml yet.
>>
>> If so, how can we best change this?
>
>
> Currently yarep is using the default tika config:
> TikaConfig tikaConfig = TikaConfig.getDefaultConfig();
>
> This uses the file /org/apache/tika/tika-config.xml contained in the
> jar file, which contains the mimetype/parser mappings.
>
> I guess to add a mimetype to that file it would be necessary to make a
> copy of the file and create a new TikaConfig(InputStream is) from the
> yarep-specific tika-config.xml.
done. One can now patch the default tika config by setting
<search-index src="index" index-fulltext="yes" index-properties="yes"
local-tika-config="tika-config.xml">
within the repo configuration
Cheers
Michi
>
> josias
>
>>
>> Thanks
>>
>> Michi
>>
>
> _______________________________________________
> Yanel-development mailing list
> Yanel-development at wyona.com
> http://lists.wyona.org/cgi-bin/mailman/listinfo/yanel-development
--
Michael Wechner
Wyona - Open Source Content Management - Yanel, Yulup
http://www.wyona.com
michael.wechner at wyona.com, michi at apache.org
+41 44 272 91 61
More information about the Yanel-development
mailing list