Indexing of folders (file repositories) take ages and times out before complete

SHLelieveld · January 17, 2019, 9:26am

Hello community,

We have a Pydio 8.2.2 instance running on CentOS 7.6, and have an indexing issue.
When we want to open a certain file repository with a lot of files & folders, it frequently timesout the connection before complete (or it is so stuck that it cannot complete). or in some cases just shows the first 1000 files or so and no more. Even if there are >1000 files/folders in it.
I think the indexing is done by the clients, and not the server it self…? Or is this done by the server, but as the repo’s change via a different path, and not through Pydio, it keeps on restarting the indexing…

I believe indexing and searching is not functioning properly if ‘odd’ characters are used like & ^ % $ @ [ ] etc. or should that not make a difference…?

We use the Pydio system as a remote File Management tool, not as cloud storage for users.
We use it to move files around from 1 repository to another.
The file repositories are for example, our FTP server and an Archive Server.
Files deliverred to the FTP are moved from the FTP repo to the Archive repo via a Pydio command.
this sometimes involves files and folders of over 500GB in size at the time.

We already tweaked the php.ini file so that commands can be bigger in size, and execution time is set to a full day.

Are there some more tweaks we can do to get the pydio system to operate faster and complete indexing the repo’s?

Sorry for the many questions in 1 topic, but it is all related I think.

Thanks.

Stefan

c12simple · January 17, 2019, 2:46pm

Hi,

A ftp workspace with more than 500 GB, I recommended to use elasticsearch engine instead of lucence plugin.
Pydio works with elasticsearch 5 (not 6)

SHLelieveld · January 18, 2019, 9:26am

Most Repo’s are normal SMB mounted shares. But those shares are also fed by an FTP server. the Pydio is a seperate server with a finger on the backend storage of the FTP basically, we do have 1 FTP repo, but that is 99% of the time 1 of the destinations to drop data, not to search or source from.

I’ll have a look into elastic search 5, never worked with that… but I’ll read into it.

Thanks,

Topic		Replies	Views
Help with manual indexing speed? Pydio 8 linux	1	743	November 4, 2019
Is it possible to use elasticsearch Pydio 8	2	464	December 19, 2018
Filename Search not working Pydio 8	12	3125	April 9, 2019
Can't see last error popup Pydio 8 linux , apache , php71	2	414	August 11, 2020
Pydio segfault on samba share with spaces Pydio 8 samba	21	2091	August 8, 2020

Indexing of folders (file repositories) take ages and times out before complete

Related topics