Indexing local file:/// without going upwards

Discussion in English language.
You can start and continue with posts in english language in all other forums as well, but if you are looking for a forum to start a discussion in english, this is the right choice.

Indexing local file:/// without going upwards

Beitragvon data2016 » So Feb 21, 2016 1:51 pm


how can I prevent a crawl from going upwards in the file hierachy given a path to start with via file:///home/user/data?
The thing that seems happening is that (maybe due to symlinks?) the crawler at some point starts index even root-folders of a linux system , so it clearly goes upwards and crawls folders like "/var /usr /boot", etc.

Crawler filter was already set to "Restrict to subpaths", any idea how to make sure Yacy only goes downwards, but never upwards in filehierachy?
Beiträge: 4
Registriert: Fr Jan 01, 2016 8:12 am

Re: Indexing local file:/// without going upwards

Beitragvon Orbiter » Fr Feb 26, 2016 11:22 am

maybe the 'restrict to subpath' does not work correctly because it expects a domain which is not there with file paths.
It should work when 'use filter' is set to a self-defined filter, like "file:///home/user/data.*" (I did not try that yet.. but if that does not work it is definitely a bug)
Beiträge: 5798
Registriert: Di Jun 26, 2007 10:58 pm
Wohnort: Frankfurt am Main

Zurück zu English

Wer ist online?

Mitglieder in diesem Forum: Bing [Bot] und 2 Gäste