Showing 17 results for Vali
Dr. Vali Derhami, Mr. Omid Mehrabi,
Volume 5, Issue 1 (Journal of Control, V.5, N.1 Spring 2011)
Abstract
One of the challenges encountered in the application of classical reinforcement learning methods to real-control problems is the curse of dimensiality. In order to overcome this difficulty, hybrid algorithms that combine reinforcement learning with various function approximators have attracted many research interests. In this paper, a novel Neural Reinforcement Learning (NRL) scheme which is based on Sarsa learning and Radial Basis Function (RBF) network is proposed. The RBF network is used to approximate the Action Value Function (AVF) on-line. The inputs of RBF network are state-action pairs of system and its outputs are corresponding approximated AVF. As the necessary condition for the convergence of NSL to the optimal task performance, the existence of stationary points for NSL which coincide with the fixed points of Approximate Action Value Iteration (AAVI) are proved. The validity of the proposed algorithm is tested through simulation examples: mountain car control task, and acrobot problem. Overall results demonstrate that our algorithm can effectively improve convergence speed and the efficiency of experience exploitation.
Dr. Vahid Behnam Gol, Dr. Iman Mohammad Zaman, Dr. Ahmadreza Vali, Dr. Nemat Allah Ghahramani,
Volume 5, Issue 3 (Journal of Control, V.5, N.3 Fall 2011)
Abstract
In this paper, a two point guidance law for homing interceptors using finite time second order sliding mode control and based on parallel navigation is proposed. In the proposed guidance law, sliding surface is selected as the line of sight rate and the target maneuvers are considered as an uncertainty which only needs the upper bound of these maneuvers. Furthermore, the proposed algorithm can guarantee the finite time convergence of the LOS rate to zero or a small neighborhood of zero. Therefore, the performance and stability of guidance loop against maneuvering targets are increased.
Mrs. Fateme Fathinezhad, Dr. Vali Derhami,
Volume 6, Issue 3 (Journal of Control, V.6, N.3 Fall 2012)
Abstract
Applying supervised learning in robot navigation encounters serious challenges such as inconsistence and noisy data, difficulty to gathering training data, and high error in training data. Reinforcement Learning (RL) capabilities such as lack of need to training data, training using only a scalar evaluation of efficiency and high degree of exploration have encourage researcher to use it in robot navigation problem. However, RL algorithms are time consuming also have high failure rate in the training phase. Here, a novel idea for utilizing advantages of both above supervised and reinforcement learning algorithms is proposed. A zero order Takagi-Sugeno (T-S) fuzzy controller with some candidate actions for each rule is considered as robot controller. The aim of training is to find appropriate action for each rule. This structure is compatible with Fuzzy Sarsa Learning (FSL) which is used as a continuous RL algorithm. In the first step, the robot is moved in the environment by a supervisor and the training data is gathered. As a hard tuning, the training data is used for initializing the value of each candidate action in the fuzzy rules. Afterwards, FSL fine-tunes the parameters of conclusion parts of the fuzzy controller online. The simulation results in KiKS simulator show that the proposed approach significantly improves the learning time, the number of failures, and the quality of the robot motion.
Eng Farzaneh Ghorbani, Dr Vali Derhami, Dr Hossein Nezamabadi Pour,
Volume 8, Issue 1 (Journal of Control, V.8, N.1 Spring 2014)
Abstract
In this paper, we present a novel continuous reinforcement learning approach. The proposed approach, called "Fuzzy Least Squares Policy Iteration (FLSPI)", is obtained from combination of "Least Squares Policy Iteration (LSPI)" and a zero order Takagi Sugeno fuzzy system. We define state-action basis function based on fuzzy system so that LSPI conditions are satisfied. It is proven that there is an error bound for difference of the exact state-action value function and approximated state-action value function obtained by FLSPI. Simulation results show that learning speed and operation quality for FLSPI are higher than two previous critic-only fuzzy reinforcement learning approaches i.e. fuzzy Q-learning and fuzzy Sarsa learning. Another advantage of this approach is needlessness to learning rate determination.
Eng. Farzane Nadi, Dr. Vali Derhami, Dr. Mehdi Rezaeian,
Volume 8, Issue 2 (Journal of Control, V.8, N.2 Summer 2014)
Abstract
Visual servoing system is a system to control a robot by visual feedback so that robot drives from any arbitrary start position to the target positions. Various ways, including control by using model of the robot, designing controller directly, and using Jacobian matrix have been studied. Since there is not access to model of robot and obtaining a model of robot would be difficult and time consuming, in many cases, the control law is obtained using Jacobian matrix. In this paper, inverse of Jacobian matrix is approximated using artificial neural networks. The approximated neural models are used in control law directly. For each degree of freedom of the robot manipulator, a two-layer feedforward neural network is considered. The distance between end-effector and target along the x-axis and y-axis, and the shoulder joint coordinates along the x-axis and y-axis are the inputs of each of the networks and the outputs are the fraction of the related robot joint changes to the image features changes (the elements of the inverse of Jacobian matrix). The proposed method has been implemented on a real robot manipulator. The experimental results show that the proposed control system can move the end-effector to different target positions in workspace with good accuracy.
Mr Vahid Behnamgol, Dr Ahmadreza Vali, Dr Ali Mohammadi,
Volume 8, Issue 2 (Journal of Control, V.8, N.2 Summer 2014)
Abstract
In this paper, a nonlinear and robust guidance system against target maneuvers has been designed. For this purpose, first a new high order sliding mode algorithm is proposed. The designed guidance law with this algorithm generates a smooth acceleration command that guarantees collision with target. In this algorithm, unlike previous high order sliding mode theories, the stability of close loop system in the presence of uncertainty is guaranteed, therefore the observer is not required for estimation of target maneuvers in the proposed guidance law. For designing two point guidance law using this algorithm, a sliding variable has been introduced using relative lateral velocity. Designed guidance law generates acceleration commands that guarantee convergence of sliding variable. Simulation results show the better performance of proposed guidance law in comparison with other guidance laws.
Farinaz Alamiyan Harandi, Vali Derhami,
Volume 11, Issue 4 (Journal of Control, V.11, N.4 Winter 2018)
Abstract
This paper proposes a framework of Supervised Deep Learning (SDL) for wheeled robot navigation in soft terrains with a focus on wall following and obstacle avoidance tasks. Here, it is supposed the robot is only equipped with a vision system (Kinect camera). The main challenge while using depth images is high dimensionality of images and extracting proper features of them with a purpose of reducing input dimensionality of controller. To do this, the deep learning is utilized in this paper and the appropriate features which are the representation of depth images are acquired. Four architectures are created using this features and the history of steering commands. These architectures are compared in WEBOT simulator. The experiments show that the proposed architecture with four groups of features including: the represented features of depth data, previous represented features, the position of trajectory in color image, and the history of previous steering commands can control the robot in soft terrain with a variety of obstacles as well.
Fatemeh Abadianzadeh, Vali Derhami, Mehdi Rezaeain,
Volume 12, Issue 1 (Journal of Control, V.12, N.1 Spring 2018)
Abstract
Vision-based robot control is a method to motion control of a robot using information extracted from visual sensors. In traditional approaches, a model of robot and camera are needed. Obtaining these models are time consuming and sometimes impossible. Recently, intelligent methods are used to cope the above challenges. In this paper, a hybrid fuzzy controller is proposed to control a robot manipulator. Visual inputs of the controller are provided by Kinect and outputs are the rotation of joints motors. The hybrid controller contains two controllers. The first controller in based on fuzzy inverse model which approximates real inverse model of robot using gathered data. In order to increase accuracy, a fuzzy expert controller is designed and it is used when the end-effector is in the predefined near-goal area. Since determining exact value of the fuzzy expert controller parameters is impossible, in addition to make system adaptive with small changes in the environment, actor-critic architecture is used. This architecture is a well known continuous reinforcement learning methods. The proposed method is applied to control a real robot manipulator (ARM_6AX18). Experimental results show that using the proposed method in practice, the end-effector reaches from any random start position to the goal position with a good accuracy in robot workspace.
Mojtaba Hadi Barhaghtalab, Vahid Meigoli, Valiollah Ghaffari,
Volume 13, Issue 3 (Journal of Control, V.13, N.3 Fall 2019)
Abstract
In this paper, an ANFIS+PID hybrid control policy has been addressed to control a 6-degree-of freedom (6-DOF) robotic manipulator. Then its error convergence has been also evaluated. The ability to formulate and estimate the system uncertainties and disturbances along with system dynamics and rejecting the disturbances effect are some advantages of the proposed method in comparing with the conventional ANFIS structures. The error convergence could not be proved in the ordinary ANFIS structures. But in the proposed method, the error convergence of the robot manipulator can be established under considering some mathematical conditions. The proposed control law is realized via parallel combination of ordinary ANFIS network and PID controller. The suggested method has been successfully applied in a 6-DOF robot manipulator system. Furthermore, in presence of uncertainties and external disturbances error convergence would be justified using the Lyapunov-like theorem and Barbalat lemma.
Mohammad Javad Rajabi, Ahmad Reza Vali, Vahid Behnamgol,
Volume 14, Issue 1 (Journal of Control, V.14, N.1 Spring 2020)
Abstract
The design of integrated guidance and control system for flying objects is one of the research fields in the aerospace that is considered by researchers in recent years. Due to the nonlinearity of the kinematic and dynamic equations of the homing interceptors in the terminal phase and also existence of uncertainties such as target maneuvers, external disturbances and variations in aerodynamic coefficients, siding mode control theory is a suitable method for designing integrated guidance and control system. One of the most problem of sliding mode is the presence of high frequency oscillations in the control signal that is called chattering, which makes it impossible to implement this controller. A method for smoothing the control signal in a sliding mode is to use a disturbance observer. In this paper, the control signal is completely smooth, with having the disturbances estimation and using a different scheme compared with standard sliding mode. Also, the finite time convergence is guaranteed in the presence of uncertainty. The proposed guidance and control system are evaluated in computer simulation.
Sanaz Sabzevari, Ahmad Reza Vali, Mohammad Reza Arvan, Seyyed Mohammadmehdi Dehghan, Mohammad Hossein Ferdowsi,
Volume 14, Issue 4 (Journal of Control, V.14, N.4 Winter 2021)
Abstract
Designing the estimator that can determine the attitude with a single sensor is vital due to the limited weight and volume in the nano-satellite, the problems caused by the limited lifetime of the mechanical gyroscope in the long term and the eclipse phenomenon. To compensate for data deficiency, a two-nested filter has been utilized in this paper. To this end, the attitude in the second filter is estimated using the sensor data and the magnetic field derivative estimation from the first filter by the extended Kalman filter. Two stochastic algorithms named as multiplicative extended Kalman filter and square-root unscented quaternion estimator are compared with the proposed symmetry-preserving nonlinear observer in order to obtain an appropriate accuracy for determining the attitude of the nano-satellite, which has only a three-axis magnetometer. The proposed method is based on invariant observers under the action of the Lie group. The moving frame approach has been used so that the observer's parameters can be adjusted through the invariant error dynamic equations. Simulation results confirm an acceptable accuracy in all three algorithms for both time and frequency response analyses. However, the root mean square error of the attitude error with a nonlinear observer is much less than the stochastic algorithm in case of a larger initial estimation error. Furthermore, this approach guarantees convergence by the Lyapunov stability proof owing to setting the parameters with periodic differential Riccati equations.
Ayoub Valipoori, Gholamreza Latif Shabgahi,
Volume 15, Issue 1 (Journal of Control, V.15, N.1 Spring 2021)
Abstract
Alarm systems play an important role in ensuring safety, and preventing event occurrence in industrial plants. One of the most important steps in alarm system designing is estimation of the proper probability density function (pdf). Conventional methods in alarm system designing like, dead-band and delay timers cannot be more effective in case of alarm variable with mixture pdf. This paper presents a new method to design an univariate alarm system with mixture pdf in alarm variables. In this paper three alarm performance indecis are derived for variables with gussian pdf. In proposed method, rasing and clearing alarms are based on the probability values corresponding to the instantaneous alarm variable values in the normal and abnormal pdfs (normal and abnormal reference models). The effectiveness of the proposed method is shown during some simulation and industrial case studies and its performance compared with Reset scenario in delay timers. In one of the case studies, the performance of the proposed method in the DAMADICS benchmark actuators has been investigated.
Reza Amjadifard, Mohammad Tavakoli Bina, Hamid Khaloozadeh, Farhad Bagheroskouei, Vali Talebzadeh,
Volume 15, Issue 2 (Journal of Control, V.15, N.2 Summer 2021)
Abstract
Resonant converter due to implementation of zero voltage switching (ZVS) or zero current switching (ZCS), are very interested. Although using these techniques, increases the efficiency and also decreases the generated EMI noise, obtaining the small-signal model of these converters is very complicated. The state-space variables mostly change as a sinusoid curve, so the average of these variables is equal to zero. Therefore the traditional method such as state-space averaging is not applicable in order to obtaining the state space equation of the converter. In this article, the state space equation is obtained by using the extended describing functions. To verify the obtained equation, the Middlebrook method is suggested. By means of this method, the bode-plot of the open-loop transfer function could be obtained based on the existing hardware. So an isolated series resonant converter is implemented and the required signals are measured in order to draw the bode-plot of the open-loop transfer function based on Middlebrook method. Verification is performed by comparing the experimental results with simulation results.
Mahsa Javaheripour, Ahmadreza Vali, Vahid Behnam Gol, Firouz Allahverdi Zadeh,
Volume 16, Issue 2 (Journal of Control, V.16, N.2 Summer 2022)
Abstract
The line of sight (LOS) rate is a parameter that is needed to calculate the acceleration applied to missiles by the proportional guidance laws in order to hit the target. This rate is usually measured using gimbaled seekers. However, if the type of missile seeker be strap down, the LOS rate must be calculated from deriving the missile's seeker output angles or estimation methodes. The derivation method is not desirable due to the noisy output of the seekers and low pass filters are needed to achieve an acceptable output, which will cause a lag in the guidance loop. In this paper, a discrete time extended state observers will be designed to estimate the LOS rate. The advantage of the time discontinuity of the observer is that issues related to the implementation of the observer on the processors, such as the choice of sampling time, considered from the design level and examined in computer simulation.
Mostafa Darabi Moghadam, Ahmadreza Vali, Seyed Mahdi Hakimi, Vahid Behnamgol, Ghasem Derakhshan,
Volume 16, Issue 3 (Journal of Control, V.16, N.3 Fall 2022)
Abstract
The two degrees of freedom servomechanism has many applications, including in gimbaled seekers. These mechanisms require closed-loop control to perform properly. In this paper, an observer-based multi-input-multi-output hybrid controller is designed for a two-degree-of-freedom servomechanism. Since in the model presented in this paper, disturbances on the mechanism are considered, so an extended state observer to estimate disturbance term to improve the controller performance. Also, due to the nonlinearity and two input- two output dynamics of these mechanisms, the use of combined nonlinear multivariate control methods to control the angle in these mechanisms will increase efficiency. For this purpose, nonlinear auxiliary control inputs are first determined in the first step. Then, in the second step, the nonlinear control input vector is determined using the multi-input-multi-output linear feedback method. In this step, a discrete time observer is used to estimate the uncertainty. The simulation results show that the proposed observer accurately estimates the disturbance and provides it to the controller. The controller designed using this information is able to control the output angles. Also, the results of the controller implementation designed in the processor in the loop test are presented.
Farzaneh Nadi, Vali Derhami, Farinaz Alamiyan Harandi,
Volume 18, Issue 2 (Journal of Control, V.18, N.2 Summer 2024)
Abstract
This paper presents a new method for using data collected from the agent's random movement in the environment for the initial adjustment of parameters of a controller with a fuzzy reinforcement learning structure. Slow learning speed and high failure rates during training are two major challenges in such structures. The initial parameterization of the fuzzy system can be a suitable solution to address these challenges. In this paper, the method of discrete value iteration is extended to continuous without relying on derivative based methods to initialize the parameters of the fuzzy system. First, random interaction with the environment is used to collect relevant data. Since the state space is continuous, the data is appropriately clustered and each cluster is considered as a state. Then, by generalizing the standard value iteration method to the continuous, the transition probability matrix and the immediate reward expectation matrix are calculated. Using the results of this stage, the initial parameterization of the fuzzy reinforcement learning structure is performed. Subsequently, these parameters are fine-tuned using reinforcement learning. The proposed method is called "Value Iteration based Fuzzy Reinforcement Learning" and is used in the problem of target following robots. The experimental results indicate a significant improvement in the performance of the proposed method in the problem of target following robots.
Morteza Firouzabadii, Ahmad Reza Vali, Abdorreza Kashaninia,
Volume 18, Issue 3 (Journal of Control, V.18, N.3 Fall 2024)
Abstract
The event-triggered approach is a scheduling strategy designed to reduce the frequency of control signal updates over time, aiming to enhance overall system efficiency concerning energy consumption and network bandwidth, while maintaining the stability and performance of the control system. This paper investigates the problem of designing an event-based controller for linear time invariant (LTI) systems in the presence of mismatched external disturbances. It is assumed that certain state variables can be directly measured as outputs. The design of an event trigger sliding mode observer is investigated by incorporating a Luenberger observer to meet the H_∞ performance requirements. A sliding mode controller is designed using the Lyapunov method to ensure that the system trajectories remain within the predefined sliding surface. Furthermore, the tuning parameters of the sliding surface and the state observer are calculated using linear matrix inequality (LMI) to achieve asymptotic stability of the sliding mode dynamics with a prescribed H_∞ index. Finally, a non-zero lower bound for the minimum inter-event interval is determined to prevent Zeno phenomena. Then, two numerical examples are provided to demonstrate the performance of the developed controller. Simulation results indicate that, besides achieving the objective of stabilizing the closed-loop system, the control commands generated by the controller are significantly reduced.