I have the same problem, I specifically need to know the hardware requirements for different volumesbased on the following 3 scenarios :
- 5 courses with 200 students each
- 1 college/school with 100 courses and 50 000 students
- Everyone : 6000 modules with 400 000 students
Please help... please
]]>Sorry for the delay in replying. At the moment we have had to suspended the implementation because of other projects which have become more critical. I'm hoping that we will start again with the Mahara implementation in the coming months. I will post more details when it gets going again.
]]>Would I recommend SSL? Well, with SSL you balance security with performance, so it's really up to you. There is a patch floating around for doing just SSL on the login page (though it doesn't work for the password reset page yet).
As for open ports - Mahara needs to be able to make outbound connections to port 80 on other machines, and also to 443 if you want https RSS feeds to work. You'll also need to configure the web servers to be able to send e-mail so the cronjobs can mail out forum post updates. This may or may not require opening ports, it depends on how you set up mail. Mahara supports SMTP if you need that.
]]>I'm afraid I have no idea what Hive is. We've previously used nfs to mount a share off a file server (no SANs where I am), with success.
A lot of our other systems all run SSL - Would you recommend SSL? if so what ports would I need and also what type of certificates would Mahara need?
Really appreciate the help so far.
Regards
Neil
]]>Another question - with regards disk storage and file based objects. Is everything stored in the database, or can you upload and store file objects? if you have file objects can these be served via Hive or is it better to have it as a simple shared file system on the SAN?
Thanks
Neil
]]>Other than that, it's all pretty simple. The code is the same on all the servers of course, as is the apache configuration.
]]>Glad to hear you haven't had problems with RedHat based OS. I'm a bit restricted in some areas as our stuff needs to sit in an Enterprise world and needs mainstream support - hence RedHat.
I'm going to have to see about the PostgreSQL vs MySql - support and maintenance are going to be my biggest issue. But I don't think I have much of a choice given the volumes we are talking about.
I'd appreciate some feedback on how you get on with your cluster work. Likewise happy to help if you get into problems.
Cheers
Neil
]]>Thanks for the help - I'll base my calculations around our Moodle which we already have in place. Disk space isn't too much of an issue - our Zimbra/Hive currently has 20Tb allocated.
Could you give me a bit more info around what you did in the deployments - was this done using Hardware network balancing, or did you use Tomcat/Apache load balancing?
Really appreciate the offer of help if we need it.
This is a bit of a critical project as it is for the entire school estate of one of the largest metropolitan councils in Europe. Our Shibboleth alone is being built to support ~1,000,000 users covering students, parents, teachers and government bodies. We are using Mahara as our official recommendation as the ePortfolio system for this deployment.
I'll keep everyone upto date with how we get on. At the moment we have Moodle deployed, Zimbra being deployed - just sorting out a issue with the proxy, Hive already deployed and linked to Moodle, and our Identity Management solution will have a first drop in July. We are working on shibboleth components for all of these. Might also be worth mentioning that we are using SIF (linked into a multi-node ZIS) for all our identity and person details. This is all linked into schools MIS systems so we have direct provisioning of identity from the schools - and the IdM + SIF will be driving the desktop provisioning of AD based services.
Keep ya posted.
Regards
Neil
]]>If it's any help - we have been running Mahara on CentOS 5.x (which is basically identical to RHEL5) for quite a while (since version 0.8.x) and there are no issues with this whatsoever. CentOS & RHEL have been our preferred production hosting platform for years now.
From our experience the choice of DB is probably more of an issue than the OS Distro. Mahara is better tested on PostgreSQL than on MySql - but having said that - we have been running all of ours on MySQL.
As for VMWare - no issues there - most of our installations are on VMWare Blade Servers.
In terms of clustering - we are looking at the same issue at the moment - so I can't give practical feedback. However - I can not foresee any issues with this as we have run Moodle in clustered environs for a while.
HTH,
Leo
Mahara requires a pretty simple LAMP-ish based stack, and therefore can scale just like any other product on such a stack. We've deployed Mahara on clusters before, and it works just fine.
I would suggest that any calculations you've done for Moodle will reflect reasonably well for Mahara, as they're basically the same stack. If anything, Mahara will perform better as it does less per page (we've made sure that, for example, no writes are done on the average page load).
It's likely people will use more file space in Mahara than in Moodle - but even if you gave all your users a 1G quota, it's highly unlikely most of them will use anywhere near that amount, which means you can get away with greatly overselling it in the short term, and tracking the usage over time.
Can you use all of those technologies? Yes, as they'll all handle LAMP stuff. As for the stack itself, I would strongly recommend you use PostgreSQL as your database if at all possible.
I guess the only other question is whether Mahara can handle the load you're talking about. As I said before, Mahara does less per page than Moodle so I believe it will scale better, and we've certainly designed it with scalability in mind (not having a complicated roles system really helps here), but we haven't deeply investigated the performance yet, so you may come across one or two foulups/slow queries on such a large deployment. We'll be happy to help you if you do!
]]>Thanks for the response. With regards the environment I'd like your opinion on the following please - we are looking at using Mahara in a fairly large enterprise environment for education. The architecture for the overall solution has a full Shibboleth Identity Management system for ~1,000,000 users, Moodel for ~100,000, Zimbra email for ~100,000 users and Hive. I'm looking at plugging Mahara into this infrastructure using Shibboleth, Moodle and Hive. Our systems are built on latest IBM blades using VMWare technology - so I would be looking at using the same for Maraha based on RedHat which is our chosen Linux OS.
In addition the system has to provide 99.9% availability, so I would need to implement it such that it can be scaled in a cluster environment.
So a couple of key questions out of this:
From a user basis we are looking at a year on year uptake starting with ~60,000 users, with a similar increase year on year, with a target user base in 3 years of ~180,000-200,000 users acvtively using our systems. We have based most of our current system calculations on a concurrency model of Yr 1=5%, Yr2=10% and Yr3=33%.
Any advice on the above would be much appreciated.
Regards
Neil
]]>As to how much resources you'll actually need, this depends on the number of users - both in total, and the maximum number of concurrent users - you're expecting. Do you have any ideas on how many users you'll have?
I am looking at architecting a solution implement Mahara in a large scale environment and I cannot find much on the server requirements - i.e. processor/memory/disk/io.
Is there any documentation which can help around this - basically I have no hardware and I need to know what to purchase and data to put together sizing volumetrics to include in a support model.
Can anyone help?
Thanks
Neil
]]>