2015-03-16 TCO Calculator discussion

Date

16 Mar 2015

Preliminary tool has been designed and distributed by PSNC.

The initial aim was to calculate the minimum volume of a local storage s= olution for PSNC.

The numbers in the tool are dummy numbers.

Short review of the existing material was made including:

SNIA cost calculator =E2=80=93 models too simplistic, failed to simula= te e.g. Ceph based cluster performance
Wmarow's IOPS calculator =E2=80=93 might be worth having a look at as = includes some modelling

Here is a summary of the discussions around the tool

to split =E2=80=9Eareas=E2=80=9D into multiple sheets =E2=80=93 so tha= t we can distribute the work
good idea would be to have a sheet on example / reference server confi= gs and disk parameters
network uplinks =E2=80=93 include 1Gbit for management in rack space /= ports budget =E2=80=93 see network part
cooling efficiency factor =E2=80=93 might be included within power pri= ce =E2=80=93 see electricity
collecting failure rates would be good (again might be perceived as se= nsitive)

Power for the disks must be separated from the power for the main boar= d and other server components: memory, network interfaces etc.
Different storage architectures need some historical data on power con= sumptions to be analysed.
Various may need power consumption modelling. Work on the models as op= tions to select.
GRNET has some data collected =E2=80=93 Panos will check if they can e= xplored and will share conclusions/data
CSC has some data / models =E2=80=93 options what can be shared will b= e checked.
PSNC will be able to provide the data it is planning to collect.
We need long-term averaged measurements not the point-in-time data as = the power consumption varies depending on system activity.
We predict the real-life data to be more usable than models however mo= delling should still be explored.

Data Centre operations costs (using the existing facility) included Bu= ilding maintenance, Security, UPS, etc. can be virtualized and factored in.=
The same model can be applied for rented spaces (co-location) =E2=80= =93 we should enable calculating in RUs
Cooling can be part of these cost or part of the electricity calculati= on separate (for now it is part of electricity cost =E2=80=93 but the impre= ssion is that this should be analysed more explicitely in the model in orde= r to enable more detailed analysis / using various parameters)

Cost of the switches is included (10 Gbit ports to servers, pair of To= R switches with uplink =E2=80=93 see below)
Uplink component is missing for. FibreChannel could also be considered= as alternative technology.
At least 10G ports for uplink per switch and network for management (a= ccess to IPMI interfaces) must be considered. Some cases 40G ports.
Should the network be redundant? It depends on the size of the setup a= nd the other redundancy features of the architecture. Network redundancy ca= n be checked against the desired availability figures. (We should be carefu= ll as saving money on switches may enforce high data replication factor =E2= =80=93 costs...)

There are some Java tolls available. Panos, GRNET will play with them = and share experiences.
Calculations can be included in a separate sheet.

Electricity, Cooling factor, Staff expenses, and other costs (i.e. OPE= X above) are the main cost components we have to calculate with.
The number of racks used should be an open parameter in the model. Dif= ferent disk/server/rack density.
Consider not only full racks but also fractions of it (couple of RUs).= In some cases this level of granularity is needed.
We could come up with 4-5 different server configurations as different= calculation models to use. Footnotes are needed to explain the differences= and the reasoning behind the models.
Create separate XLS sheets per cost component, and re-distribute the c= alculator.
Share numbers on power consumption, if possible.
Using Goolge doc with dummy numbers on sensitive data (staffing, etc.)= is fine for now. Some XLS Marcos may need to be developed off-line. Let's = decide how to share the final product later.

Document editing:

Maciej, PSNC to share the updated version of the TCO calculator (split= into areas such as servers, energy, staff etc) and start collecting commen= ts/notes in a separate document, to be developed as a "Cost Effective Stora= ge how-to"
ALL, to connect on the TCO tools and start working on the various shee= ts.

IOPS modelling:

Panos, GRNET to play with the IOPS tools and let us know the experienc= es.
others to review / look for other sources of IOPS/bandwidth, power con= sumption models

Power usage (important part of cost):