Break scientific isolation of Pakistani physicists and bring them to the norms of international productivity in science. . .

DG Message

Site Map

 

Computing Projects


Several Projects catering the needs of different departments at NCP have been completed and a few are in pipeline and near completion. These projects have enhanced the working efficiency of various departments of NCP by automization and eliminating the traditional manual working. Following is the list of projects.

Projects Completed in the Past
The IT group of NCP has been involved in the development of the following successfully completed projects

WLCG Firewall

A firewall protects your computer or a local area network (LAN) from external intrusions. It may be a hardware device or a software program running on a secure host computer. A firewall examines all traffic routed between the two networks to see if it meets certain criteria. If it does, it is routed between the networks, otherwise it is stopped. Firewalls can filter packets based on their source and destination addresses and port numbers. This is known as address filtering.

In order to enhance the protection of the Grid from external attacks, a firewall service for the Grid components is desired. Such a firewall would need to be distributed amongst the different node types, and its rules would need to be built and updated dynamically in order to:

  • Include the different (sub)networks that may be used by a site
  • Dynamically integrate and exclude Grid sites
  • Insure that the rules have not been tempered with

NCP grid node is operational since June 2004 with constant upgrades to middleware software and hardware resources. Today we are running the latest middleware on the NCP-LCG2 grid node. In May 2007, the deployment team started to work on putting the grid node behind a firewall. By end of June 2007, the firewall service has been deployed and tested thoroughly. Final implementation was done in the first week of July, 2007.

At the server side a database keeps the records of genuine grid machines along with their IP addresses and roles in the grid node. A web application checks the clients connecting to the server and authorizes them if their IP addresses exist in the database. Then based on the role of each client the server publishes the firewall rules in Linux iptables format files which client adopts.

At the client side, a small RPM is installed with a single script that runs as a scheduled process (cron job). This script pulls the published firewall rule file from the server and verifies that the new configuration allows it to have network access. It then installs this new rule file on the host. It also provides a rollback mechanism when the new configuration fails.

Using this mechanism firewall rules for the whole grid site can be updated/modified in no time. New nodes and roles (node types) can be added. It also provides ease of management with a central management server.

Team: Usman A. Malik, Sajjad Asghar, Adeel ur Rehman, Sobia Aqeel

Network Information System for NCP

It describes the Process of Network and Computing Resource Management in NCP. The system is a web based application offering remote manipulation by the end-users. The main purpose of this application is:

  • Manage hardware components and specifications.
  • Record complete details of software installed in a particular hardware.
  • Record complete details of hardware related to user.
  • Record complete details of location with respect to office place and network service where hardware is placed.
  • Record operations performed on hardware and maintain history.
Three types of  users can interact with the system:
  • Administrator
  • Data Entry Operator
  • Normal User

Users can login to the system using their windows account information. Specific rights will be assigned to them according to their roles.

Administrator can perform the following operation:
  • Manage Role
  • Manage Privileges
  • Manage Hardware
  • Manage Location
  • Manage Manufacturer
  • Manage H/W Models
  • Manage Domain
  • Manage Responsible Group
  • Manage Vendor
  • Manage Service
  • Manage Interfaces
  • Search Hardware
  • Sign Request
Following privileges are assigned to DEO:
  • Manage Hardware
  • Manage Location
  • Manage Manufacturer
  • Manage H/W Models
  • Manage Domain
  • Manage Responsible Group
  • Manage Vendor
  • Manage Service
  • Search Hardware
  • Manage Interfaces
User can:
  • Search for available device
  • Request new connection for a device
  • Request to modify a registered device
  • Request to move and reconnect a device
  • Request to remove a device from the network
Other automatic operations such as history management, Reminders, Email notification are also implemented in this system.
Team: Sobia Aqeel
PBSMON (Portable Batch System MONitor)

PBS stands for the Portable Batch System. It is a batch system that is based on a server and client model. You define the master node as the batch master (server) and the rest of the machines works as its clients. There are other commercial batch systems available like LSF and CONDOR. At NCP we use OpenPBS, a free and open source version of the PBS. This free version is scalable and proficient for up to 800 nodes, which is more than our current or future needs. The Computing Element (CE) in NCP-LCG2 grid node is acting as the PBS master/server node, while the Worker Nodes (WN) are acting as slave nodes.

Initially, the grid node consisted of one server and fourteen (14) clients only, but as the number of clients grow (which has now reached 80), managing or troubleshooting errors in all clients one by one is very tiring and time consuming. To provide efficient management and troubleshooting of PBS server(s)/ and clients a PBS Monitor system has been developed. The basic purpose of this system is to provide all the logs from PBS servers or clients at a single place which makes troubleshooting very easy. PBS log files from all the server and client machines are parsed every ten minutes using a scheduled (cron) job and the new entries (entries generated after the last parsing) are uploaded to a central server. This central server is running a web page which displays all PBS nodes. Another parser is scheduled to run every ten minutes on the server, which looks for known error codes and problems in the log files. If it finds any problem or error codes in the log files the color of corresponding PBS node on the web page is changed to orange (in case of warning message) or red (in case of critical errors). So one can find out a possible problem just by looking at the web page rather than digging it by logging onto several nodes one at each time. Email alerts can also be generated in case of critical errors. This would enable to monitor the system efficiently even if the number of nodes grows to a very large scale.

The server also compresses the big log files to save space.
Team: Usman A. Malik, Adeel ur Rehman, Sobia Aqeel

 
 

Copyright © NCP, 2003 - .  All Rights Reserved.