I have heard that Sharepoint could be the answer to a loosely managed, fast growing file share. I was asked to try and find a solution for a file share which currently houses millions of files, and terabytes of data using Sharepoint. I see a number of huge challenges with this exercise if we use Sharepoint, and causing many questions.Â
Should we import the documents into the database, should we use RBS, should the files stay on the file share and just import metadata or references? How would that happen? What type of storage, how would you set up the content databases? Would you use multiple drives? How will this effect backups? Also what is the best way to migrate 10s of millions of files?Â
We would probably want to eliminate duplicates, keep versions, and store meta-data for easy searching.
Does anyone have any experience, ideas or suggestions regarding this type of process being managed in Sharepoint? Are there any third party applications that make sense? Has anyone actually done something like this before.
Thanks in advance for your help.Â
Vikki McCormick
In my experience I have found that it is not always suitable for everything to live within SharePoint. You will more than likely need to use a mixture of File Share and SharePoint. Also with things like versioning if you were to move lots of data into a SharePoint 2010 environment you could soon find your space requirements rapidly increasing as each change to a file is saved as another copy of the file. This of course is not such a big issue for SharePoint 2013 which utilises Shredded Sotrage, where only the deltas are saved.
I would also test the various file types that you intend to move into SharePoint, just to ensure they work as expected. Your office documents will no doubt be fine and require little if no thought at all but other file types will need to be checked prior to migration. An example would be access database files. I had one instance where a customer moved everthing into SharePoint and had to move the Access databases back into the file share and have only the forms that relied on the databases stored in SharePoint. Again things are improving, so you do now get the chance to create Access database based applications with SharePoint in mind, so not so much of an issue.
With the mention of RBS storage, this could be a good candidate if you have a lot of media content, but does make for a slightly more complex recovery strategy and can leave you with limitations such as no DB Mirroring.
