Presentation is loading. Please wait.

Presentation is loading. Please wait.

Alfresco Scalability Benchmarking Before telling how cool Alfresco is, you better prove it!

Similar presentations


Presentation on theme: "Alfresco Scalability Benchmarking Before telling how cool Alfresco is, you better prove it!"— Presentation transcript:

1 Alfresco Scalability Benchmarking Before telling how cool Alfresco is, you better prove it!

2 Few items we’ll talk about… Scalability and benchmarking in the ECM context Alfresco 4 is rocket scalable. And we got proofs If your software scales, your BM framework must scale more!

3 Scalability and benchmarking What do you know about it?

4 One size fits all…

5 How scalable is your ECM? “ECM systems can be scalable or they can fail to scale well. They can have modular architectures that allow you to simply add more elements as required, rather than multiply the entire system as things expand. They can be scalable in that they have built in high availability, automatic failover support, run on enterprise grade application servers and databases. They can be scalable because they have been tested and proven to handle very high volumes (hundreds of millions of documents) in the repository and/or tested and proven to handle very high throughput rates (tens of thousands per hour or minute). There are many ways in which an ECM system can scale or not. But the biggest element determining whether the system can scale is your usage of it” http://www.realstorygroup.com/Blog/1403-Scalable-ECM

6 Alfresco ECM Solutions

7 Factors at Alfresco Scalability Just so we talk the same language:

8 What you need to know about Alfresco 4? Key scalability changes:

9 The game changer(s) – Node Creation Alfresco 3.x

10 Game changer(s) – Node Creation Alfresco 4.x NOTE: Solr will index with a default period of 15s

11 Game changer(s) – Node Search Alfresco 3.x

12 The game changer(s) – Node Search Alfresco 4.x NOTE: Solr needs to track ACLs!

13

14 Alfresco 4 Scalability Benchmarks The journey to the ultimate knowledge of Alfresco

15 Why benchmarking?

16 Verba volant, scripta manent Alfresco Product Validation of Alfresco 4 scalability Product bottlenecks Alfresco Field Sizing Achieve even more stellar use-cases Customers & partners Provide reference milestones Allow contextualized benchmarks

17 Benchmark projects overview BM-0001 – Alfresco 4 Scalability – 4.0.0 BM-0002 - 3.4 vs. 4.0 (Share Scenario - 360 u – 1 node)– 3.4.8 4.0.0 BM-0003 - SOLR vs. Lucene (Share Scenario) – 4.0.0 BM-0004 - Uncover repository limit using simple Site-based data – 4.0.2 BM-0005 - Measure Alfresco Cloud signup rates up to 125K users - Cloud BM-0006 - Measure Activiti workflow service performance – 4.0.2 BM-0007 - Measure Alfresco workflow API performance – 4.0.2 BM-0008 - Simulate 125K users on Alfresco Cloud – Cloud BM-0009 - Define optimal tuning and extrapolate sizing information for large scale Share Enterprise deployment - 4.1.1.x

18 BM-0001 and BM-0009 Common factors Benchmark Lab Enterprise Collaboration Scenario Technologies (Alfresco Enterprise + Jmeter) Differences Load testing scripts Database tier Repository content Both provided useful insight!

19 The benchmark lab (HW)

20 BM-0001 – Alfresco 4 Scalability benchmark You can never forget your first time! Objective Statistic analysis of Alfresco Scalability Best effort pre-tuning Scenario Search intensive Collaboration scenario 10s think time Implemented with Jmeter Async requests Memory intensive

21 BM-0001 – Scenario http://svn.alfresco.com/repos/alfresco-open-mirror/benchmark/scripts/SHARE/share-0001/V4.0.0/

22 BM-0001 – Scalability data points

23 BM-0001 – Architecture

24 BM-0001 - Software

25 BM-0001 – Scalability results

26 In other words…

27 BM-0001 take-aways 1100 concurrent users on 10M docs! With high search %, load is mostly on Solr Share is lightweight, repo not loaded Solr can be memory intensive Make sure you give enough memory! Scale out when needed! Dedicated Alfresco for Solr tracking beneficial

28 The “Alfresco 4 Scalability blueprint” Broad intro to Alfresco 4 Scalability Scalability analysis of BM-0001 results Architectural options Tuning and configuration details Load analysis Sizing and performance reference Available for Enterprise Customers & Partners at http://support.alfresco.comhttp://support.alfresco.com

29 BM-0009 – Real life collaboration Errare humanum est, perseverare diabolicum! Objective Create a real repository and scenario 2*Alfresco + 2*Solr Find optimal tuning / sizing for large concurrency Scenario Much less search intensive than BM-0001 15s think time Implemented with Jmeter More Async requests Less Memory intensive

30 BM-0009 – Scenario https://svn.alfresco.com/repos/alfresco-enterprise/benchmark/scripts/SHARE/share-0002/4.0.2/

31 BM-0009 – Repository details Share Sites: 10 000 Avg members per site: 100 + 3 random groups #files per site: 10 2MB docs + 1k docs of 1kB 3 level deep folder structure, each folder of the hierarchy contains 5 documents and 5 folders Users in repo: 50 000 Groups: 30 000 hierarchical (depth=7) Storage used: 1TB Number of nodes: >10M Number of assocs: >10M child assocs

32 BM-0009 – Architecture

33 BM-0009 - Software

34 BM-0009 – Scalability results

35 Once again…

36 BM-0009 take-aways Did I already say Alfresco ROCKS? Even on a high end realistic scenario, avg time 1.2s with 500 concurrent users (with 2*Alfresco and 2*Solr) Dedicated Alfresco tracking beneficial mostly for operational purposes (similar performance) Adding Solr nodes allows further degrees of scalability

37 Lessons learned (tuning tips) As we are not in marketing…numbers are not enough!

38 Make sure house is clean! Allow enough Open files Allow enough connections to your DB Allow enough Tomcat threads (but don’t exaggerate!) Disable your anti-virus! Check your balancer, and first test without it!

39 Areas you might want to tune Alfresco Repository db.pool.* (and DB connections accordingly) (only if your know what you’re doing) L2-caches and transactional caches solr.maxConnections Solr Solr caches in solrcore.properties #tracking threads (default=3). I/O bound so don’t exaggerate! alfresco.maxConnections mergeFactor in solrconfig.xml

40 Dedicated vs. shared tracking

41

42 Benchmark gotchas Jmeter is very memory intensive 30G to scale to 1100 users! Not fit for cloud-scale Jmeter does not parse Javascript Complex hacked implementation to mimic Share Full emulation is impossible! We need a scalable, distributed, flexible framework to run cloud-scale benchmarks!

43 Alfresco Benchmark framework I.e. cool people doing even cooler stuff to prove they are cool! Derek Hulley, Repository Lead Engineer

44 Gabriele Columbro Principal Architect Consulting Services @mindthegabz Derek Hulley Founding Engineer and Repository Team Lead


Download ppt "Alfresco Scalability Benchmarking Before telling how cool Alfresco is, you better prove it!"

Similar presentations


Ads by Google