victor-gartvich.com
Building Technical Operations: Metrics, metrics, metrics...
http://www.victor-gartvich.com/2013/06/metrics-metrics-metrics.html
Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services. Monday, June 17, 2013. Metrics, metrics, metrics. A lot has been said about the importance of system and application metrics - I'll not repeat this, and will concentrate on of-the-shelf options available to implement a robust, usable and scalable metrics collection and monitoring system. We will talk about three main areas:. There are several Graphite...
victor-gartvich.com
Building Technical Operations: Operations requirements for in-house R&D products
http://www.victor-gartvich.com/2011/08/operations-requirements-for-in-house-r.html
Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services. Monday, August 29, 2011. Operations requirements for in-house R&D products. This post will help you to define specific requirements from Operations to R&D for all in-house software provided for production deployment if you faced with in-house R&D products. Packaging and package names. Easy deployment in production and QA environments. There should be ...
victor-gartvich.com
Building Technical Operations: Remote Control for Your Production Site
http://www.victor-gartvich.com/2011/10/remote-control-for-your-production-site.html
Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services. Friday, October 14, 2011. Remote Control for Your Production Site. Have you ever found yourself rushing to the office data center in the middle of the night just because a critical server is down, and you don't have any means to remotely access the server's console or reset the power? How much time and money have you spent on line with. Graphical LOM ...
victor-gartvich.com
Building Technical Operations: My list of favorite Nagios check scripts
http://www.victor-gartvich.com/2011/09/my-list-of-favorite-nagios-check.html
Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services. Tuesday, September 13, 2011. My list of favorite Nagios check scripts. Http:/ exchange.nagios.org. Https:/ www.monitoringexchange.org/. The goal of the post is to share with you my list of favorite Nagios check scripts, so next time you will need to deploy a Nagios instance just use the page as a reference for requires check modules. MySQL service mon...
victor-gartvich.com
Building Technical Operations: How to start?
http://www.victor-gartvich.com/2011/08/how-to-start.html
Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services. Tuesday, August 30, 2011. Some people ask me - how to start building a production system, and make sure that it will be reliable, scaleable and manageable when it will outgrow two cabinets in one location (a kind of my definition of a small system)? Select the OS platform and distribution to go with (if your R&D leaves you a choice). Define the locati...
victor-gartvich.com
Building Technical Operations: Knowledge and information management
http://www.victor-gartvich.com/2011/06/knowledge-and-information-management.html
Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services. Thursday, June 23, 2011. Knowledge and information management. Equipment inventory details will be stored only in your Purchase Orders, most likely kept in your mailbox. Low level design details can be found only in emails asking to perform cable wiring, or will be lost at all. For a list and comparison table of available Wiki software. From open ...
victor-gartvich.com
Building Technical Operations: Vendor Management Tips
http://www.victor-gartvich.com/2013/12/vendor-management-tips.html
Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services. Sunday, December 1, 2013. This post is for technical operations folks managing vendor contracts. For technical people dealing with vendors and legal paperwork can be a bit complicated, and I hope that the information from the post will be very handy. Price negotiations and service testing. Management of vendor contracts. Recommended ways to approach a...
victor-gartvich.com
Building Technical Operations: Equipment naming convention
http://www.victor-gartvich.com/2011/09/equipment-naming-convention.html
Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services. Sunday, September 4, 2011. Why it is so important? Having a clear and meaningful equipment naming convention will help you to:. Minimize human errors of executing right commands on wrong servers. Spend less time analyzing monitoring alerts and log messages. Have less problems while automating system management tasks. To code inside a host name as much...
victor-gartvich.com
Building Technical Operations: What to monitor on a Linux box
http://www.victor-gartvich.com/2013/06/what-to-monitor-on-linux-box.html
Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services. Monday, June 17, 2013. What to monitor on a Linux box. Relevant articles from the blog:. Metrics, metrics, metrics. My list of favorite Nagios check scripts. System metric monitored on a Linux box using Nagios and SNMP-based plugins:. ICMP reachability (packet loss and delay). RAM and swap memory usage. Disk space usage on all local partitions. Cisco ...