Difference between revisions of "Links Monitoring pages"
From GridPP Wiki
								
												
				 (→UKI Nagios/myegi monitoring)  | 
				 (→Network Monitoring)  | 
				||
| (112 intermediate revisions by 13 users not shown) | |||
| Line 3: | Line 3: | ||
== Grid Monitoring ==  | == Grid Monitoring ==  | ||
| − | + | * [http://gstat-wlcg.cern.ch/apps/capacities/sites WLCG REBUS Capacities]  | |
| − | * [http://gstat-wlcg.cern.ch/apps/capacities/sites WLCG   | + | |
| − | === UKI Nagios/myegi monitoring ===  | + | === UKI Nagios/myegi/argo monitoring ===  | 
| − | + | ||
| − | + | ||
| − | + | * [http://argo.egi.eu/lavoisier/status_report-site?ngi=NGI_UK&report=Critical&accept=html ARGO UK sites]  | |
| − | * [  | + | |
| − | + | ||
| − | + | ||
| − | + | ||
| − | + | ||
| − | + | ||
| − | + | ||
| − | + | ||
| − | + | ||
=== Security monitoring ===  | === Security monitoring ===  | ||
| Line 25: | Line 13: | ||
* [https://operations-portal.egi.eu/csiDashboard/ngiDetails/ngi/NGI_UK/view_type/monitoring/colspan/unknown EGI Security Dashboard UK view]  | * [https://operations-portal.egi.eu/csiDashboard/ngiDetails/ngi/NGI_UK/view_type/monitoring/colspan/unknown EGI Security Dashboard UK view]  | ||
| − | == Transfers   | + | == Transfers Monitoring ==  | 
| − | * [  | + | * [https://lcgfts3.gridpp.rl.ac.uk:8449/fts3/ftsmon/#/ RAL FTS3 (production instance) monitoring web app]  | 
| − | + | * [http://ganglia.gridpp.rl.ac.uk/cgi-bin/ganglia-fts/fts3-sites.pl RAL FTS3 (production instance) ganglia plots]  | |
| − | + | * [https://fts3-test.gridpp.rl.ac.uk:8449/fts3/ftsmon/#/ RAL FTS3 (test instance) monitoring web app]  | |
| − | + | ||
| − | + | ||
| − | * [http://ganglia.gridpp.rl.ac.uk/cgi-bin/ganglia-fts/fts3-sites.pl RAL FTS3 ganglia plots]  | + | |
| − | * [  | + | |
| − | + | ||
| − | + | ||
| − | + | ||
| − | + | ||
| − | + | ||
| − | + | ||
| − | + | ||
* [http://dashb-wlcg-transfers.cern.ch/ui/ WLCG transfers dashboard]  | * [http://dashb-wlcg-transfers.cern.ch/ui/ WLCG transfers dashboard]  | ||
* [http://dashb-fts-transfers.cern.ch/ui/ FTS dashboard]  | * [http://dashb-fts-transfers.cern.ch/ui/ FTS dashboard]  | ||
| − | * [  | + | |
| + | == Network Monitoring with perfSONAR==  | ||
| + | |||
| + | * [https://psmad.opensciencegrid.org/maddash-webui/index.cgi?dashboard=UK%20Mesh%20Config WLCG perfSONAR dashboard]  | ||
| + | * [https://ps-dash.dev.ja.net/maddash-webui/index.cgi?dashboard=UK%20Mesh%20Config Jisc perfSONAR dashboard]  | ||
| + | * [https://psetf.opensciencegrid.org/etf/check_mk/index.py?start_url=%2Fetf%2Fcheck_mk%2Fview.py%3Fhostgroup%3DUK%26opthost_group%3DUK%26view_name%3Dhostgroup WLCG/OSG perfsonar Check_MK ]  | ||
| + | * [http://ps-dashboard.es.net/maddash-webui/index.cgi?dashboard=6%3A%20ESnet%20to%20International ESnet International perfSONAR dashboard]  | ||
== Accounting ==  | == Accounting ==  | ||
| − | * [  | + | * [https://twiki.cern.ch/twiki/bin/view/LCG/AccountingFAQ WLCG Accounting FAQ]  | 
| + | * [https://accounting-next.egi.eu/egi/country/United%20Kingdom New EGI accounting portal]  | ||
| + | * [http://tinyurl.com/hevnfz5 Experiments APEL comparison UK]  | ||
| + | * [http://tinyurl.com/zdtco8j ATLAS APEL comparison UK last 3 months]  | ||
| + | * [http://tinyurl.com/zksowcl CMS APEL comparison UK last 3 months]  | ||
== Experiment Monitoring ==  | == Experiment Monitoring ==  | ||
| − | ===   | + | === ATLAS ===  | 
| − | + | * [http://adc-monitoring.cern.ch ADC Monitoring]    | |
| − | + | * '''Blacklist specific pages'''  | |
| − | + | ** [https://bigpanda.cern.ch/sites/?cloud=UK BigPanda Queues status]  | |
| − | + | ** [http://atlas-agis.cern.ch/agis/pandablacklisting/list Panda Queue Blacklist page]  | |
| − | + | ** [http://atlas-agis.cern.ch/agis/ddmblacklisting/list/ DDM blacklist page]  | |
| + | '''Job Monitoring'''  | ||
| + | *'''Historical'''  | ||
** [http://panglia.triumf.ca Panglia]  | ** [http://panglia.triumf.ca Panglia]  | ||
| − | ** [http://  | + | ** [http://tinyurl.com/ounn5od Atlas Historical Dashboard UK view] (Soon obsolete)  | 
| − | *'''Production'''  | + | ** [https://monit-grafana.cern.ch/d/a62E4PgWk/job-accounting-uk-cloud?orgId=17 New grafana monitoring UK view]  | 
| − | ** [  | + | *'''Big Panda Production'''  | 
| − | *'''Analysis'''  | + | ** [https://bigpanda.cern.ch/dash/production/?cloudview=region#cloud_UK Big Panda production dashboard]  | 
| − | ** [  | + | *'''Big Panda Analysis'''  | 
| − | + | ** [https://bigpanda.cern.ch/dash/analysis/#cloud_UK Big Panda Analysis dashboard]  | |
** [http://hammercloud.cern.ch/hc/app/atlas HammerCloud tests (V4)]  | ** [http://hammercloud.cern.ch/hc/app/atlas HammerCloud tests (V4)]  | ||
| − | ** [http://apfmon.lancs.ac.uk/ Pilot   | + | *'''Pilot/Harvester worker'''  | 
| − | *'''AGIS'''  | + | ** [http://apfmon.lancs.ac.uk/ Pilot wrapper job monitoring]  | 
| − | + | ** [https://tinyurl.com/y63jvapb Harvester monitoring]  | |
| − | + | '''AGIS'''  | |
| − | + | * [http://atlas-agis.cern.ch/agis/atlassite/table_view/ Site Configuration]  | |
| − | + | * [http://atlas-agis.cern.ch/agis/panda_queue/table_view/ Queue Configuration]  | |
| − | + | '''DDM and transfers'''  | |
| − | + | * [https://tinyurl.com/y2a2mjrn DDM grafana dashboard]  | |
| − | + | ** [https://tinyurl.com/yynnsc8a UK as a source]  | |
| − | + | ** [https://tinyurl.com/yy2arjqk UK as a destination]  | |
| − | + | '''Storage'''  | |
| − | + | * [http://tinyurl.com/kqqx2vd ATLAS UK storage accounting]  | |
| − | + | * [http://www.hep.lancs.ac.uk/~love/ukdata/ Peter Love's pledge monitoring ]  | |
| − | + | * [http://adc-ddm-mon.cern.ch/ddmusr01/plots Rucio Space Tokens plots]  | |
| − | + | * [https://rucio-ui.cern.ch/bad_replicas?state=SUSPICIOUS List of suspicious files]  | |
| − | + | '''SUM and Lloyds'''  | |
| − | + | * [http://tinyurl.com/jhlczrh ATLAS ETF (nagios)]  | |
| − | + | * [http://tinyurl.com/nejr8r4 ATLAS ETF(dashboard)]  | |
| − | + | * [http://wlcg-squid-monitor.cern.ch/snmpstats/mrtgatlas2/indexatlas2siteUKI-NORTHGRID-MAN-HEP.html ATLAS Squid monitoring]  | |
| − | + | '''Other services'''  | |
| − | + | * [https://atlas-logbook.cern.ch/elog/ATLAS+Computer+Operations+Logbook/?Cloud=^UK%24 Atlas shifters elog]  | |
| − | + | '''UK Support Mailing Lists'''  | |
| − | + | * <span style="color:	#0000ff">atlas-support-cloud-uk@NOSPAMcern.ch</span> Atlas UK Cloud Support (use for help solving problems and in GGUS tickets)  | |
| − | + | * <span style="color:	#0000ff">atlas-uk-comp-operations@NOSPAMcern.ch</span> Atlas UK Computing Operations (use for general discussion)  | |
| − | + | * [https://indico.cern.ch/categoryDisplay.py?categId=4620 Atlas UK meeting (Thursday 10am)]  | |
| − | + | ||
| − | + | ||
| − | + | ||
=== CMS ===  | === CMS ===  | ||
| − | |||
| − | |||
| − | |||
* [http://dashboard.cern.ch/cms Dashboard]  | * [http://dashboard.cern.ch/cms Dashboard]  | ||
| − | * [http://  | + | * [http://tinyurl.com/h467yn9 CMS ETF (nagios)]  | 
| − | * [  | + | * [http://tinyurl.com/kyzmghs CMS ETF (dashboard) T2]  | 
| − | * [http://  | + | * [http://wlcg-sam-cms.cern.ch/templates/ember/#/plot?group=Tier3s&profile=CMS_CRITICAL&sites=T3_UK_London_QMUL%2CT3_UK_London_RHUL%2CT3_UK_London_UCL%2CT3_UK_ScotGrid_GLA%2CT3_UK_SGrid_Oxford CMS ETF (dashboard) T3]  | 
| − | + | * [http://dashb-ssb.cern.ch/dashboard/request.py/siteview? Site status Board (T2 and T3)]. Click on 'Analysis' and 'Production' for further details.  | |
| − | * [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?view=global PhEDEx transfer rate plots]    | + | |
| + | * [https://cms-site-readiness.web.cern.ch/cms-site-readiness/SiteReadiness/HTML/SiteReadinessReport.html#T2_UK_London_Brunel Site Readiness status (T2s only starts at Brunel, scroll down for other UK sites)]  | ||
| + | * [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?view=global PhEDEx data transfer rate plots]    | ||
** [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?graph=quantity_rates&entity=src&src_filter=UK&dest_filter=&no_mss=true&period=l96h&upto= From UK sites]    | ** [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?graph=quantity_rates&entity=src&src_filter=UK&dest_filter=&no_mss=true&period=l96h&upto= From UK sites]    | ||
** [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?graph=quantity_rates&entity=dest&src_filter=&dest_filter=UK&no_mss=true&period=l96h&upto= To UK sites]  | ** [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?graph=quantity_rates&entity=dest&src_filter=&dest_filter=UK&no_mss=true&period=l96h&upto= To UK sites]  | ||
| + | * Debug transfers from UK sites: [https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T2_UK_London_IC&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T2_UK_London_IC],[https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T2_UK_London_Brunel&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T2_UK_London_Brunel],[https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T2_UK_SGrid_RALPP&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T2_UK_SGrid_RALPP], [https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T2_UK_SGrid_Bristol&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T2_UK_SGrid_Bristol],   | ||
| + | [https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T3_UK_London_QMUL&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T3_UK_London_QMUL],[https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T3_UK_London_RHUL&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T3_UK_London_RHUL],[https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T3_UK_ScotGrid_GLA&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T3_UK_ScotGrid_GLA],(no debug transfers for Oxford)  | ||
=== LHCb ===  | === LHCb ===  | ||
| − | * [  | + | * [http://tinyurl.com/zcttfdf LHCB ETF (nagios)]      | 
| − | * [http://  | + | * [http://wlcg-sam-lhcb.cern.ch/templates/ember/#/historicalsmry/heatMap?group=Tier%200%2F1&profile=LHCb_CRITICAL&site=LCG.SARA.nl%2CLCG.RRCKI.ru%2CLCG.RAL.uk%2CLCG.PIC.es%2CLCG.NIKHEF.nl%2CLCG.IN2P3.fr%2CLCG.GRIDKA.de%2CLCG.CNAF.it%2CLCG.CERN.ch&time=Last%2024%20Hours LHCb ETF (dashboard)]  | 
| − | * [  | + | * [https://lhcb-portal-dirac.cern.ch/DIRAC/?view=tabs&theme=Grey&url_state=1|*DIRAC.SiteSummary.classes.SiteSummary:, Site Summary]  | 
| − | * [  | + | * [https://lhcb-portal-dirac.cern.ch/DIRAC/?view=tabs&theme=Grey&url_state=1|*LHCbDIRAC.Accounting.classes.Accounting:, LHCb Accounting] Please choose what you want to look at in the first drop-down box called "category". Hints : "Data operation" = Transfer operations, "Job" = Completed jobs, "WMS history" = Pilot jobs, "Pilot" = Pilot statuses.  | 
| − | *   | + | * [http://pprc.qmul.ac.uk/~walker/votable.html Steve Lloyd User Monitoring (lhcb)]  | 
| − | * [http://pprc.qmul.ac.uk/~  | + | |
| − | + | ||
| − | + | ||
| + | === ALICE ===  | ||
| + | * [http://tinyurl.com/huwrqs3 ALICE ETF (nagios)]  | ||
| + | * [http://wlcg-sam-alice.cern.ch/templates/ember/ ALICE ETF (dashboard)]  | ||
* [http://alimonitor.cern.ch/stats?page=SE/table SE tests]  | * [http://alimonitor.cern.ch/stats?page=SE/table SE tests]  | ||
* [http://alimonitor.cern.ch/siteinfo/?site=RAL Site Overview]  | * [http://alimonitor.cern.ch/siteinfo/?site=RAL Site Overview]  | ||
| Line 123: | Line 110: | ||
* [http://alimonitor.cern.ch/display?page=jobResUsageSum_time_cpu CPU Accounting]  | * [http://alimonitor.cern.ch/display?page=jobResUsageSum_time_cpu CPU Accounting]  | ||
* [http://alimonitor.cern.ch/display?page=FTD/SE RAW data repication speed]  | * [http://alimonitor.cern.ch/display?page=FTD/SE RAW data repication speed]  | ||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
== Tickets ==  | == Tickets ==  | ||
| Line 153: | Line 131: | ||
* Argus  | * Argus  | ||
** https://www.gridpp.ac.uk/wiki/ARGUS_deployment  | ** https://www.gridpp.ac.uk/wiki/ARGUS_deployment  | ||
| − | {{KeyDocs|responsible=Alessandra Forti|reviewdate=  | + | {{KeyDocs|responsible=Alessandra Forti|reviewdate=2015-11-15|accuratedate=2015-11-15|percentage=90}}  | 
Latest revision as of 08:41, 7 August 2019
Contents
Grid Monitoring
UKI Nagios/myegi/argo monitoring
Security monitoring
- EGI Pakiti - only if you are registered as security officer in GOCDB
 - EGI Security Dashboard UK view
 
Transfers Monitoring
- RAL FTS3 (production instance) monitoring web app
 - RAL FTS3 (production instance) ganglia plots
 - RAL FTS3 (test instance) monitoring web app
 - WLCG transfers dashboard
 - FTS dashboard
 
Network Monitoring with perfSONAR
- WLCG perfSONAR dashboard
 - Jisc perfSONAR dashboard
 - WLCG/OSG perfsonar Check_MK
 - ESnet International perfSONAR dashboard
 
Accounting
- WLCG Accounting FAQ
 - New EGI accounting portal
 - Experiments APEL comparison UK
 - ATLAS APEL comparison UK last 3 months
 - CMS APEL comparison UK last 3 months
 
Experiment Monitoring
ATLAS
- ADC Monitoring
 - Blacklist specific pages
 
Job Monitoring
- Historical
- Panglia
 - Atlas Historical Dashboard UK view (Soon obsolete)
 - New grafana monitoring UK view
 
 - Big Panda Production
 - Big Panda Analysis
 - Pilot/Harvester worker
 
AGIS
DDM and transfers
Storage
- ATLAS UK storage accounting
 - Peter Love's pledge monitoring
 - Rucio Space Tokens plots
 - List of suspicious files
 
SUM and Lloyds
Other services
UK Support Mailing Lists
- atlas-support-cloud-uk@NOSPAMcern.ch Atlas UK Cloud Support (use for help solving problems and in GGUS tickets)
 - atlas-uk-comp-operations@NOSPAMcern.ch Atlas UK Computing Operations (use for general discussion)
 - Atlas UK meeting (Thursday 10am)
 
CMS
- Dashboard
 - CMS ETF (nagios)
 - CMS ETF (dashboard) T2
 - CMS ETF (dashboard) T3
 - Site status Board (T2 and T3). Click on 'Analysis' and 'Production' for further details.
 
- Site Readiness status (T2s only starts at Brunel, scroll down for other UK sites)
 - PhEDEx data transfer rate plots
 - Debug transfers from UK sites: T2_UK_London_IC,T2_UK_London_Brunel,T2_UK_SGrid_RALPP, T2_UK_SGrid_Bristol,
 
T3_UK_London_QMUL,T3_UK_London_RHUL,T3_UK_ScotGrid_GLA,(no debug transfers for Oxford)
LHCb
- LHCB ETF (nagios)
 - LHCb ETF (dashboard)
 - Site Summary
 - LHCb Accounting Please choose what you want to look at in the first drop-down box called "category". Hints : "Data operation" = Transfer operations, "Job" = Completed jobs, "WMS history" = Pilot jobs, "Pilot" = Pilot statuses.
 - Steve Lloyd User Monitoring (lhcb)
 
ALICE
- ALICE ETF (nagios)
 - ALICE ETF (dashboard)
 - SE tests
 - Site Overview
 - Active jobs per site
 - CPU Accounting
 - RAW data repication speed
 
Tickets
Status tracking links
Links for tracking the deployment status at sites.
- SL6
 -  WebDAV
- https://www.gridpp.ac.uk/wiki/WebDAV WebDAV/xrootd
 
 - IPv6
 - perfSONAR
 - Backup VOMS servers
 - Argus
 
This page is a Key Document, and is the responsibility of Alessandra Forti. It was last reviewed on 2015-11-15 when it was considered to be 90% complete. It was last judged to be accurate on 2015-11-15.