Contents [hide]
Monitor the health of your application infrastructure with Elasticsearch & Kibana

Elasticsearch is an open-source, distributed search and analytics engine that is commonly used for log analytics, full-text search, and operational intelligence.โฏKibanaโฏis a free open-source data visualization tool that provides a tight integration with Elasticsearch and is the default choice for visualizing data stored in the latter.
How They Work Together
Data is sent to Elasticsearch in the form of JSON files via the Elasticsearch API or other ingestion tools such as Logstash or Amazon Kinesis Firehose. Elasticsearch then proceeds to store the document and adds a searchable reference to the document in the clusterโs index which can be retrieved using the Elasticsearch API. This data stored in Elasticsearch can be used to easily set up dashboards and reports with Kibana to gain access to analytics and additional operational insights.
โThe ability to make sense out of data is no longer simply a competitive advantage for enterprises, it has become an absolute necessity for any company in an increasingly complex and statistics-driven world. The visualizations provided by Kibana on Elasticsearch data can quickly provide deep business insight.โ
โ Brad Winett, TrackIt President
Helping ElephantDrive Take Advantage of Kibana Dashboards to Better Monitor their APIs
ElephantDriveโฏis a leading service supplier that provides individuals and businesses simple but powerful tools for protecting and accessing their data. With ElephantDrive, ordinary people enjoy the peace of mind that comes from the type of enterprise-class backup, storage, and data management that has historically only been available to big corporations.
ElephantDrive wanted to improve its ability to store, analyze, and visualize log information, so they set up a basic ELKโฏ(Elasticsearch, Logstash, Kibana)โฏstack. The initial Kibana implementation was in place but without any of the dashboards that make it such a valuable tool, so ElephantDrive approached the TrackIt team and asked us to analyze ElephantDrivesโs current Elasticsearch logs to recommend dashboards that could be set up to allow for better log monitoring. Two were created for this specific purpose:
- A โdata.operationโ dashboard that displays the distribution of requests by operation in a pie chart
- A โdata.apiKeyโ dashboard that displays the average response time per API key
โWe were able to get the basic stack up quickly, but wanted to turn the data into actionable information โ the Track It team not only helped us leverage the power of Kibanaโs visualizations, but also provided the education, documentation, and tools for us to take the next steps on our ownโ
โ Michael Fisher, ElephantDrive CEO and Co-Founder
The following is a thorough tutorial that will first walk the reader through the general process of setting up dashboards using Elasticsearch and Kibana before illustrating the steps we took to set up these two dashboards for ElephantDrive.
Accessing Elasticsearch & Kibana
Communication with Elasticsearch is done via HTTP requests. We have used Postman in this example, which provides us with a more graphical interface to make requests. To access Elasticsearch, you can make requests in the following way using a curl in a shell script:
curl -v โhttp://ec2-XXX-XX-X-XX.compute-1.amazonaws.com:9200/_cat/indices?vโ
To accessโฏKibana, load this URL in your browser :
http://ec2-XXX-XX-X-XX.compute-1.amazonaws.com:5601
Logstash Ingestion Issue & How To Fix It
ElephantDrive had an issue with their Logstash. Under some rare circumstances, the Logstash ingestion was failing and the following error message was received:
[2020โ03โ04T22:34:52,349][
WARN
][logstash.outputs.elasticsearch]
Could
not index event to
Elasticsearch
. {:status=>400, :action=>[โindexโ, {:_id=>nil, :_index=>โlogstash-2020.03.04", :_type=>โdocโ, :_routing=>nil}, #<
LogStash
::
Event
:0x16a5ee83>], :response=>{โindexโ=>{โ_indexโ=>โlogstash-2020.03.04", โ_typeโ=>โdocโ, โ_idโ=>โAXCnr_f9Ski653_WeeEoโ, โstatusโ=>400, โerrorโ=>{โtypeโ=>โmapper_parsing_exceptionโ, โreasonโ=>โfailed to parse [data]โ, โcaused_byโ=>{โtypeโ=>โillegal_state_exceptionโ, โreasonโ=>โCanโt get text on a START_OBJECT at 1:171"}}}}}
This error was thought to be coming from a malformed log entry arriving at the exact moment a new Elasticsearch index is created. This would happen if the malformed log entry is the first one sent to Logstash on a new day since Logstash creates a new index each day.
Since the Elasticsearch mapping is dynamically created from the message parsed by Logstash, a malformed message will put a wrong mapping in the index, which will, in turn, stop the correct message from being ingested.
Fixing the Logstash Ingestion Issue
If you are facing a similar issue, the first step to take is to shut down Logstash. Once Logstash is shut down, you need to delete the incriminated index. The index name can be found in the Logstash log (and is typically โlogstash-YYYY.MM.DDโ).
Open Kibana, and go to โDev Toolsโ:

Delete the index with the following command (using the name of the incriminated index) :

You can then restart Logstash and let it ingest new log entries.
Once Logstash has recreated the index, you will also need to refresh the field list to get the correct mapping (under Management > Index Pattern > logstash-*) :

Getting Started With Kibana
How To Create A Dashboard with Kibana โ Pie Chart Example
In this initial example, we will walk you through the process of creating a pie chart dashboard that shows the most performed queries.
- Go to theโฏโVisualizeโโฏtab and click theโฏโCreate new visualizationโโฏbutton

- SelectโฏโCompare parts of a wholeโ (Pie chart)

- The data required to create this dashboard is located in theโฏโlogstashโโฏdatabase.

- SelectโฏโSplit Slicesโ

- UnderโฏโAggregationโ, chooseโฏโTermsโ

- Selectโฏโdata.operation.keywordโ

- Choose how you would like to order the dataand alsothe number of slices withโฏโSizeโ

- Click onโฏโApply changesโ

This is what the result looks like:

Creating the โAverage response time per API Keyโ Dashboard
The first dashboard we created for ElephantDrive is a data table that displays the average response time per API key. The steps to implement this dashboard are as follows:
- Go to theโฏโVisualizeโโฏtab and click theโฏโCreate new visualizationโโฏbutton

- For this type of data, choosingโฏโData Tableโโฏis a relevant choice.

- The data required to create this dashboardis located intheโฏโlogstashโโฏdatabase.

- Now we need to add a row for the API keys, so underโฏโselect buckets typeโ, chooseโฏโSplit Rowsโ

- To find API keys, chooseโฏTermsโฏunderโฏโAggregationโโฏand chooseโฏdata.apiKey.keywordโฏunderโฏโFieldโ

- To add the average response time per API key to the data table, click on โAdd metricsโโฏunderโฏโmetricsโ

- Under โAggregationโ chooseโฏAverage, and underโฏโFieldโโฏchooseโฏdata.response.totalTime

- Click onโฏโApply changesโ.โฏWe now have a dashboard that shows us the average response time per API key:

Creating the โAverage time per operationโ Visualization
The second dashboard we created for ElephantDrive displays the average time elapsed (in ms) by operation type. The steps to implement this dashboard are as follows:
- Go to the โVisualizeโ tab and click the โCreate new visualizationโโฏbutton

- For this type of data, choosingโฏโVertical Barโโฏis a relevant choice.

- The data required to create this dashboardis located intheโฏโlogstashโโฏdatabase.

- We first want to see the average time, so in the โmetricsโ section choose โAverageโ inโฏAggregationโฏand โdata.response.totalTimeโ inโฏField.

- Then we want to add a metric

- Select โDote Sizeโ

- And select โCountโ asโฏAggregation, to be able to see how many times the query has been used.

- Now, to see which method is used (GET, POST, PUT, etc..) wehave togo in theโฏBucketโฏsection, and chooseโฏโSplit Seriesโ

- And then โTermsโ asโฏAggregation, โdata.request.method.keywordโ asโฏfield, andโฏOrder ByโฏโAverage timeโ (created step 4)

- Finally, to see where the operation has been made, we will add a sub-bucket

- And create a โTermsโโฏSub Aggregation, with โdata.operation.keywordโ asโฏField

There you have it!

And you can see more details by hovering over a section with your mouse

Better Visibility & Enhanced Productivity In Log Monitoring
Cloud based infrastructure can sometimes feel like a black box with only limited visibility into its efficiency. Utilizing Kibana and Elasticsearch provides ElephantDrive with informative dashboards that provide insight into their compute environment, enhancing the efficacy of their log monitoring efforts.
About TrackIt
TrackIt is an international AWS cloud consulting, systems integration, and software development firm headquartered in Marina del Rey, CA.
We have built our reputation on helping media companies architect and implement cost-effective, reliable, and scalable Media & Entertainment workflows in the cloud. These include streaming and on-demand video solutions, media asset management, and archiving, incorporating the latest AI technology to build bespoke media solutions tailored to customer requirements.
Cloud-native software development is at the foundation of what we do. We specialize in Application Modernization, Containerization, Infrastructure as Code and event-driven serverless architectures by leveraging the latest AWS services. Along with our Managed Services offerings which provide 24/7 cloud infrastructure maintenance and support, we are able to provide complete solutions for the media industry.