English and German Forums for YaCy: uncensorable, untraceable search engines and freedom of information.
apt-get install nodejs
tar -axf LanguageDetect.tar.gz
davide hat geschrieben:Any progress?
Language detection is a core feature for a search engine and in YacY it barely works. To my understanding, detection is currently based on date format recognition and <head> tag. We both already know this is fuzzy at best.
For as much as it's simple, the algorithm I implemented in JS for my TTS software works very well for all the six supported languages. I also provided you with a demonstrative package ready to download and run, so to rapidly taste its effectiveness with the bundled demo. I also know you have experience with JS so you can understand the code.
I submitted many patches to other FOSS projects in the past; its unfortunate Java is not in my cultural baggage yet. Can you at least tell me where in the priority list is a reimplementation of the language detector?
Orbiter hat geschrieben:the language detection in YaCy was always fuzzy. Just recently I made experiments with language detection based on bayes filters in the loklak project. This works in many cases, but fails also quite often. This is just really a complex thing. I will try to add the loklak method to YaCy maybe, I already added the bayes classes but they will be used for something else first.
biolizard89 hat geschrieben:I think the concern here is that davide has offered to assist, and his offer has, as far as I can tell from this thread, been met with silence. @Orbiter, is YaCy willing to look at davide's code?
Orbiter hat geschrieben:biolizard89 hat geschrieben:I think the concern here is that davide has offered to assist, and his offer has, as far as I can tell from this thread, been met with silence. @Orbiter, is YaCy willing to look at davide's code?
You recommended davids code: did YOU actually test it?
davide hat geschrieben:I created a JS fIddle with my language detector.
Now you can test it directly on the browser:
biolizard89 hat geschrieben:Hi,
Any idea how well your code performs in terms of accuracy compared to the two methods supported by Solr?
Mitglieder in diesem Forum: 0 Mitglieder und 2 Gäste