Jul 22, 2015

EMC - iSilon - INSIGHTIQ Overview



EMC Isilon allows you to easily manage petabytes of storage. With InsightIQ, you can go a step beyond, with advanced monitoring and reporting tools to identify performance bottlenecks in your workflows and minimize the amount of high-performance storage required in your environment. InsightIQ provides you with advanced, near-realtime and historical, analytics to optimize applications, correlate workflow and network events, as well as deep views to the data being stored on the cluster. To optimize capacity usage, InsightIQ provides reports with a clear breakdown of capacity usage, quotas usages over time and deduplication effectiveness


1. This section shows the Aggregate view of data from all clusters
  •  Capacity is shown
  •  Clusters currently being monitored
  •  Combined health of all clusters
  •  Current active statistics for all clusters
2. These are recent trend charts for the last 12 hours, sometimes known as sparklines


Scroll down on the dashboard and highlight the following:
1. Similar statistics are displayed for each cluster as is displayed at an aggregate level above.
2. Performance details for this specific cluster can be jumped to by clicking this link.
3. File system details for this specific cluster can be jumped to by clicking this link.
4. Estimated usable capacity will display on clusters OneFS 7.2 that have run an FSA job. Clicking the Capacity
Reporting link will provide details of the origin of the Usable Capacity number.
5. Clicking the + icon will expand the details for that cluster.
6. The order can be changed by clicking and dragging the handle on the right.



Showing Performance Reports
Select the performance report and cluster
1. Select the PERFORMANCE REPORTING Tab.
2. Select Cluster Performance from the "Select a report:" menu.
3. Select cribsbiox from the "Apply to cluster:" menu


Select the date range and view the report
1. In order to give the best demonstration in this vLab, it is best to select the date range of 10/21/2014 - 10/28/ 2014.
2. Click the View Report button.


Show the simplicity of the chart view
Highlight the following:
1. Any of the report views created here can be scheduled, automatically generating a PDF and emailing it.
2. The report can be customized allowing administrators to choose exactly which charts to include in the report.
3. The zoom level can be changed here as well as moving forward and backward in time.
4. Custom filters can be made here allowing administrators to quickly view the data they are interested in.
5. Any of the charts can be collapsed here.
6. Breakouts are a very powerful way to show some of the details of what is going on in the chart.
7. Any chart can be downloaded as a CSV by clicking this link. Administrators can also SSH to the InsightIQ server and dump large sections of performance data to CSV file. This can be automated with cron also.
8. The charts are all interactive and allow data detail to be shown by simply hovering over that part of the chart. Administrators can also click and drag to move the timeline, double-click to zoom in and double-click while holding the shift key to zoom out

Find the heaviest client
  • Click the Client link in the "Breakout by" section.
  • This makes it very clear that 10.7.189.141 is using most of the throughput during this time window.
  • Filter on the client by clicking on the IP address.
  • Note: if in production, the system will reverse look up address to display DNS names here


Understanding Capacity with InsightIQ
1. Click the FILE SYSTEM REPORTING tab.
2. Click the Capacity Reporting link.
3. Select dog-puddles from the "Apply to cluster:" menu.
4. Click the View Report button.


There is a lot of data here, but it does help administrators understand what is currently consumed for storage and how
the estimated value of more capacity is calculated. This is very useful, as the adaptive nature of OneFS is so different
from other storage systems, it can bring a lot of clarity to administrators. This page is laid out and calculated like a
simple subtraction equation to make it easy to follow. Note: all number are in base 2.
  • 1. "Total Capacity", this is the total raw capacity of the entire cluster, not including overhead.
  • 2. "Unallocated Capacity", in the context of Isilon OneFS, there should only be a number here with there is a newnode type add that is not yet 3 more nodes. So there is capacity on that new node, but data cannot be put onit until there are at least 3 of that node time.
  • 3. "Allocated Capacity", this is the capacity available for data to be stored on. This is usually the same and TotalCapacity.
  • 4. "Reserved for Virtual Hot Spares", this is reservation of space at a cluster level for drive failures to insure thereis enough capacity for a drive rebuilt to complete as OneFS does not have stand-by hot spares and uses allthe drives, all the time
  • 5. "Writeable Capacity", this is the capacity in which data can be written.
  • 6. "User Data including Protection", this is all the physical space consumed by the data currently stored on the
cluster, including the protection overhead to store it safely.
  • 7. "Snapshots Usage", this is the space consumed by all the snapshots on the cluster.
  • 8. "Remaining Capacity", this is the left over raw capacity that can still have data written to.
  • 9. "Estimated Additional Protection Overhead", this number is based on the latest File System Analytics job thatwas run. This tells InsightIQ how much data physically vs logically is being consumed, which is used to givean estimate for future data that will be added. This assumes that the data to be added will be similar to thedata currently stored. Not this number is very low because there are many thin provisioned files on thecluster. That means they are logically very large, but physically very small. This make the number small.However, if the data to be added is similar to what is storaged, this should not be an issue.
  • 10. This is drop-down that will list all the File System Analytics (FSA) jobs and gives the option to choose anestimation based on a report other than that latest.
  • 11. "Estimated XXX of Usable Capacity", this is estimation of how much more logical (what the end user would understand) data can be added to the cluster.


Useful Link

EMC Isilon InsightIQ Overview

Technical Demo: EMC Isilon InsightIQ

Isilon Student Guide Foundations
Useful Link









No comments:

Post a Comment