[Yanel-dev] Yarep Search and Tika

Michael Wechner michael.wechner at wyona.com
Thu May 22 00:10:40 CEST 2008


Josias Thöny wrote:

> Michael Wechner wrote:
>
>> Hi
>>
>> I have noticed that
>>
>> src/impl/java/org/wyona/yarep/impl/repo/vfs/VirtualFileSystemOutputStream.java 
>>
>>
>> is using Tika
>>
>> TikaConfig tikaConfig = TikaConfig.getDefaultConfig()
>>
>> but is it possible that Tika doesn't cover the  mime type 
>> application/xhtml+xml yet.
>>
>> If so, how can we best change this?
>
>
> Currently yarep is using the default tika config:
> TikaConfig tikaConfig = TikaConfig.getDefaultConfig();
>
> This uses the file /org/apache/tika/tika-config.xml contained in the 
> jar file, which contains the mimetype/parser mappings.
>
> I guess to add a mimetype to that file it would be necessary to make a 
> copy of the file and create a new TikaConfig(InputStream is) from the 
> yarep-specific tika-config.xml.


done. One can now patch the default tika config by setting

  <search-index src="index" index-fulltext="yes" index-properties="yes" 
local-tika-config="tika-config.xml">

within the repo configuration

Cheers

Michi

>
> josias
>
>>
>> Thanks
>>
>> Michi
>>
>
> _______________________________________________
> Yanel-development mailing list
> Yanel-development at wyona.com
> http://lists.wyona.org/cgi-bin/mailman/listinfo/yanel-development



-- 
Michael Wechner
Wyona      -   Open Source Content Management - Yanel, Yulup
http://www.wyona.com
michael.wechner at wyona.com, michi at apache.org
+41 44 272 91 61



More information about the Yanel-development mailing list