EAI Endorsed Transactions on AI and Robotics https://publications.eai.eu/index.php/airo <p>EAI Endorsed Transactions on AI and Robotics (eISSN: 2790-7511) covers all aspects of robotics and knowledge-based AI systems along with interdisciplinary approaches to computer science, control systems, computer vision, machine learning, electrical engineering, intelligent machines, mathematics, and other disciplines. An important goal of this journal is to extend cutting-edge technologies in the control and learning of both symbolic and sensory robots with regard to smart systems. Our journal contains articles on the theoretical, mathematical, computational, and experimental aspects of robotics and intelligent systems.</p> <p><strong>INDEXING</strong>: CrossRef, Google Scholar, ProQuest, EBSCO, CNKI, Dimensions</p> EAI en-US EAI Endorsed Transactions on AI and Robotics 2790-7511 <p>This is an open access article distributed under the terms of the <a href="https://creativecommons.org/licenses/by-nc-sa/4.0/">CC BY-NC-SA 4.0</a>, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.</p> Designing Automation for Pickup and Delivery Tasks in Modern Warehouses Using Multi Agent Path Finding (MAPF) and Multi Agent Reinforcement Learning (MARL) Based Approaches https://publications.eai.eu/index.php/airo/article/view/3449 <p>A warehouse pickup and delivery problem finds its solution using multi agent path finding (MAPF) approach. Also, the problem has been used to showcase the capabilities of the multi agent reinforcement learning (MARL). The warehouse pickup and delivery work needs the agent to pick up a requested item and successfully deliver it to the intended location within the warehouse. The problem has been solved based on two approaches that include single shot and lifelong problem solution. The single shot solution has the delivery as the final goal and thus once it reaches the delivery address, it stops whereas in case of lifelong, the agent needs to deliver the item which it had picked, deliver it to the required place and then again pick up new item until requests are satisfied. The strategy used by multi agent path finding (MAPF) approach aims at constructing collision free paths to reach the delivery location but in case of multi agent reinforcement learning (MARL), the agents’ decision making tactics (or policies) are learned which are then used to help agents decide path to be followed based on environment state and agent’s position. The results show that the lifelong conflict based search (CBS) is a better option when the agents are less in number as in that case, the re-planning will take overall less time but when the agents are large in number then this re-planning can take very long to produce conflict free paths from source to goal nodes. In this case, shared experience action critic (SEAC) which is based on multi agent reinforcement learning (MARL) approach can be more efficient choice as it takes the current environment state to give the most suitable action for that time t. For this study the agents taken for learning are homogeneous in nature that can pickup and deliver any type of requested item. We can address the same pickup and delivery problem when the agents are not all same and differ in their capabilities and the type of item they can handle.</p> Shambhavi Mishra Rajendra Kumar Dwivedi Copyright (c) 2023 Shambhavi Mishra, Rajendra Kumar Dwivedi https://creativecommons.org/licenses/by-nc-sa/4.0 2024-03-18 2024-03-18 3 10.4108/airo.3449 Implementation of GPT models for Text Generation in Healthcare Domain https://publications.eai.eu/index.php/airo/article/view/4082 <p>INTRODUCTION: This paper highlights the potential of using generalized language models to extract structured texts from natural language descriptions of workflows in various industries like healthcare domain</p><p>OBJECTIVES: Despite the criticality of these workflows to the business, they are often not fully automated or formally specified. Instead, employees may rely on natural language documents to describe the procedures. Text generation methods offer a way to extract structured plans from these natural language documents, which can then be used by an automated system.</p><p>METHODS: This paper explores the effectiveness of using generalized language models, such as GPT-2, to perform text generation directly from these texts</p><p>RESULTS: These models have already shown success in multiple text generation tasks, and the paper's initial results suggest that they could also be effective in text generation in healthcare domain. In fact, the paper demonstrates that GPT-2 can generate comparable results to many current text generation methods.</p><p>CONCLUSION: This suggests that generalized language models can increase the efficiency and accuracy in text generation, where workflows are repetitive and sequential.</p> Anirban Karak Kaustuv Kunal Narayana Darapaneni Anwesh Reddy Paduri Copyright (c) 2024 Anirban Karak, Kaustuv Kunal, Narayana Darapaneni, Anwesh Reddy Paduri https://creativecommons.org/licenses/by-nc-sa/4.0 2024-04-09 2024-04-09 3 10.4108/airo.4082 A Comparison of the Performance of Six Machine Learning Algorithms for Fake News https://publications.eai.eu/index.php/airo/article/view/4153 <p><strong>INTRODUCTION</strong>: This research focuses on the increasing importance of social media websites as versatile platforms for entertainment, work, communication, commerce, and accessing global news. However, it emphasizes the need to use this power responsibly.</p><p><strong>OBJECTIVES</strong>: The objective of the study is to evaluate the performance of artificial intelligence algorithms in detecting fake news.</p><p><strong>METHODS</strong>: Through a comparison of six machine learning algorithms and the use of natural language processing techniques,</p><p><strong>RESULTS</strong>: The study identifies four algorithms with a 99% accuracy rate in detecting fake news.</p><p><strong>CONCLUSION</strong>: The results demonstrate the effectiveness of the proposed method in enhancing the performance of artificial intelligence algorithms in addressing the problem of fake news detection.</p> Rafah H Al-Furaiji Hasan Abdulkader Copyright (c) 2024 Rafah H Al-Furaiji, Hasan Abdulkader https://creativecommons.org/licenses/by-nc-sa/4.0 2024-03-20 2024-03-20 3 10.4108/airo.4153 Improved Hybrid Preprocessing Technique for Effective Segmentation of Wheat Canopies in Chlorophyll Fluorescence Images https://publications.eai.eu/index.php/airo/article/view/4621 <p>Precision agriculture heavily relies on accurately segmenting wheat canopies from chlorophyll fluorescence (CHF) images. However, these images often face challenges due to inherent noise and illumination variations, primarily induced by the thermal activity of photons emitting a fluorescence effect. The unique nature of fluorescence introduces variations in illumination, especially during the crop's dark adaptation before experimentation. This adaptation aims to capture the full fluorescence effect, starting from minimum fluorescence and progressing to maximum fluorescence.&nbsp;In the initial stages of fluorescence, images tend to appear darker compared to those progressing towards maximum fluorescence. This variability necessitates the development of a sophisticated hybrid approach to eliminate noise and enhance contrast collaboratively, maximizing the benefits derived from CHF images. This paper introduces a novel hybrid preprocessing approach designed to address these challenges.&nbsp;The proposed method integrates five denoising techniques, namely Discrete Cosine Transform, Block Matching-3D, Low-Rank Matrix Approximation, Wiener Filtering, and Median Filtering, to mitigate the impact of noise in CHF images. Simultaneously, two enhancement techniques, Adaptive Histogram Specification and Gamma Correction, are employed to accentuate critical features, compensating for inherent variations in illumination during the fluorescence process.&nbsp;The hybrid preprocessing technique was proposed after analysing different combinations of denoising and enhancement techniques. Through qualitative and quantitative analysis of the results, it was observed that Block Matching-3D&nbsp;denoising&nbsp;with Gamma Correction produced the best output, with an Average PSNR of 0.54 and Average MSE of 0.07. This cascaded approach not only emphasizes noise reduction but also prioritizes the enhancement of crucial information within CHF images.&nbsp;By synergistically combining denoising and enhancement methods, the proposed approach optimizes the overall quality of the images, laying a foundation for improved wheat canopy segmentation. This research contributes a comprehensive and innovative solution to the challenges associated with CHF images in precision agriculture. The proposed hybrid approach holds promise for advancing the accuracy and reliability of wheat canopy segmentation, thereby enhancing the efficacy of precision agricultural practices.</p><p>&nbsp;</p> Ankita Gupta Copyright (c) 2024 Ankita Gupta https://creativecommons.org/licenses/by-nc-sa/4.0 2024-01-08 2024-01-08 3 10.4108/airo.4621 Utilizing Fundamental Analysis to Predict Stock Prices https://publications.eai.eu/index.php/airo/article/view/5140 <p>Portfolio management involves the critical task of determining the optimal times to enter or exit a stock in order to maximize profits in the stock market. Unfortunately, many retail investors struggle with this task due to unclear investment objectives and a lack of a structured decision-making process. With the vast number of stocks available in the market, it can be difficult for investors to determine which stocks to invest in. As a result, there is a growing need for the development of effective investment decision support systems to assist investors in making informed decisions. Researchers have explored various approaches to building such systems, including predicting stock prices using sentiment analysis of news, articles, and social media, as well as historical trends and patterns. However, the impact of financial reports filed by companies on stock prices has not been extensively studied. This paper aims to address this gap by using machine learning techniques to develop a more accurate stock prediction model based on financial reports from companies in the Nifty 50. The financial reports considered include quarterly reports, annual reports, cash flow statements, and ratios.</p> Akshay Khanpuri Narayana Darapaneni Anwesh Reddy Paduri Copyright (c) 2024 Akshay Khanpuri, Narayana Darapaneni, Anwesh Reddy Paduri https://creativecommons.org/licenses/by-nc-sa/4.0 2024-03-22 2024-03-22 3 10.4108/airo.5140 Improving recognition accuracy for facial expressions using scattering wavelet https://publications.eai.eu/index.php/airo/article/view/5145 <p>One of the most evident and meaningful feedback about people’s emotions is through facial expressions. Facial expression recognition is helpful in social networks, marketing, and intelligent education systems. The use of Deep Learning based methods in facial expression identification is widespread, but challenges such as computational complexity and low recognition rate plague these methods. Scatter Wavelet is a type of Deep Learning that extracts features from Gabor filters in a structure similar to convolutional neural networks. This paper presents a new facial expression recognition method based on wavelet scattering that identifies six states: anger, disgust, fear, happiness, sadness, and surprise. The proposed method is simulated using the JAFFE and CK+ databases. The recognition rate of the proposed method is 99.7%, which indicates the superiority of the proposed method in recognizing facial expressions.</p> Mehdi Davari Aryan Harooni Afrooz Nasr Kimia Savoji Masoumeh Soleimani Copyright (c) 2024 Mehdi Davari, Aryan Harooni, Afrooz Nasr, Kimia Savoji, Masoumeh Soleimani https://creativecommons.org/licenses/by-nc-sa/4.0 2024-03-13 2024-03-13 3 10.4108/airo.5145 Fog Cloud Computing and IoT Integration for AI enabled Autonomous Systems in Robotics https://publications.eai.eu/index.php/airo/article/view/3617 <p class="ICST-abstracttext"><span lang="EN-GB">Fog Cloud Computing and the Internet of Things are transforming robotics by empowering AI-enabled autonomous systems. This study analyzes the benefits, drawbacks, and uses of this integration. AI-enabled autonomous robots can use edge computing and cloud resources for real-time data processing and decision-making, improving their performance and adaptability. Communication protocols, data management, security, and scalability are examined in the ecosystem. Case studies reveal how this confluence affects robotics applications. This research shows how FCC, IoT, and AI may improve robotic systems' efficiency, intelligence, and autonomy. The article covers AI-enabled autonomous systems in transportation, manufacturing, healthcare, agriculture, and smart cities. These technologies can improve productivity and safety in many fields, from self-driving automobiles to surgical robots. Integrating these technologies raises safety, ethical decision-making, data privacy, and security concerns. The report emphasizes transparent and ethical AI algorithms, unbiased decision-making, and regulatory frameworks to enable responsible integration and mitigate dangers. In the future, AI-enabled autonomous systems will be shaped by improved AI algorithms, multi-modal sensing, human-robot collaboration, and edge intelligence. It emphasizes the necessity of interdisciplinary collaboration and ethical considerations in responsible technology development. This study concludes with a detailed analysis of fog/cloud computing, IoT, and AI in robotics, revealing the immense promise and problems of AI-enabled autonomous systems. Responsible development and collaboration can help us negotiate this transformational frontier and create a safer, more efficient, and innovative society with AI-driven autonomous systems.</span></p><p class="ICST-abstracttext"><span lang="EN-GB">&nbsp;</span></p> Kiran Deep Singh Prabhdeep Singh Copyright (c) 2023 Prabhdeep Singh, Kiran Deep Singh https://creativecommons.org/licenses/by-nc-sa/4.0 2024-03-12 2024-03-12 3 10.4108/airo.3617 Interdisciplinary Approaches: Fog/Cloud Computing and IoT for AI and Robotics Integration https://publications.eai.eu/index.php/airo/article/view/3620 <p>Fog/Cloud Computing and the Internet of Things have created intriguing opportunities for AI and robotics integration. This study examines interdisciplinary approaches that combine FCC, IoT, AI, and Robotics to construct sophisticated autonomous systems. These integrated systems may efficiently and intelligently conduct complicated tasks by using edge devices and cloud resources. Communication protocols, data management, security, and interoperability are studied in this interdisciplinary environment. Real-world case studies demonstrate the practicality and benefits of this integration. This study shows how interdisciplinary approaches will change AI and robotics integration. In conclusion, the intersection of Fog/Cloud Computing, IoT, AI, and Robotics is influencing autonomous systems. Edge devices and the cloud enable robots to become intelligent, adaptable, and essential parts of many industries. This research encourages researchers, practitioners, and policymakers to collaborate on innovation and widespread adoption of disruptive technologies. Interdisciplinary techniques are essential to maximizing AI and robotics integration and launching a new era of intelligent automation</p> Prabh Deep Singh Kiran Deep Singh Copyright (c) 2023 Kiran Deep Singh, Prabh Deep Singh https://creativecommons.org/licenses/by-nc-sa/4.0 2024-01-08 2024-01-08 3 10.4108/airo.3620 Computer vision recognition in the teaching classroom: A Review https://publications.eai.eu/index.php/airo/article/view/4079 <p>Artificial intelligence introduces computer vision recognition into the teaching classroom, and computer vision recognition technology lays a solid foundation for the intelligent teaching classroom. Through the classroom camera video stream to the classroom student information data collection, voice, posture, facial, physiological signal data recognition analysis processing to extract and define the characteristics of student behaviour, automatic classification behaviour and then record and display student behaviour, thus effectively help teachers to grasp the students learning state and emotions, to promote the quality of teaching has far-reaching significance.</p> Hui Jiang Wentao Fu Copyright (c) 2024 Hui Jiang, Wentao Fu https://creativecommons.org/licenses/by-nc-sa/4.0 2024-01-08 2024-01-08 3 10.4108/airo.4079 Exploring the Capabilities of NeRF in Generating 3D Models https://publications.eai.eu/index.php/airo/article/view/5360 <p class="ICST-abstracttext"><span lang="EN-GB">This review paper presents a comprehensive analysis of three cutting-edge techniques in 3D content synthesis: EG3D, DreamFusion, and Magic3D. EG3D, leveraging geometry-aware representations and generative adversarial networks, enables the generation of high-quality 3D shapes. DreamFusion integrates text-to-image diffusion models with neural rendering, opening new horizons for creative expression. Magic3D, on the other hand, extends text-to-image synthesis principles to 3D content creation, synthesizing realistic and detailed models. We delve into the theoretical frameworks, neural network architectures, and loss functions of these techniques, analyzing their experimental results and discussing their strengths, weaknesses, and potential applications. This review serves as a valuable resource for researchers and practitioners, offering insights into the latest advancements and pointing towards future directions for exploration in 3D content synthesis.</span></p> Shun Fang Copyright (c) 2024 Shun Fang https://creativecommons.org/licenses/by-nc-sa/4.0 2024-04-22 2024-04-22 3 10.4108/airo.5360 A Survey of Data-Driven 2D Diffusion Models for Generating Images from Text https://publications.eai.eu/index.php/airo/article/view/5453 <p>This paper explores recent advances in generative modeling, focusing on DDPMs, HighLDM, and Imagen. DDPMs utilize denoising score matching and iterative refinement to reverse diffusion processes, enhancing likelihood estimation and lossless compression capabilities. HighLDM breaks new ground with high-res image synthesis by conditioning latent diffusion on efficient autoencoders, excelling in tasks through latent space denoising with cross-attention for adaptability to diverse conditions. Imagen combines transformer-based language models with HD diffusion for cutting-edge text-to-image generation. It uses pre-trained language encoders to generate highly realistic and semantically coherent images, surpassing competitors based on FID scores and human evaluations in DrawBench and similar benchmarks. The review critically examines each model's methods, contributions, performance, and limitations, providing a comprehensive comparison of their theoretical underpinnings and practical implications. The aim is to inform future generative modeling research across various applications.</p> Shun Fang Copyright (c) 2024 Shun Fang https://creativecommons.org/licenses/by-nc-sa/4.0 2024-04-22 2024-04-22 3 10.4108/airo.5453 Micro robot as the feature of robotic in healthcare approach from design to application: the State of art and challenges https://publications.eai.eu/index.php/airo/article/view/5602 <p>Micro robots, miniature robotic devices typically ranging from micrometers to a few millimeters in size, hold immense potential in various fields, particularly healthcare. Their diminutive stature enables access to intricate anatomical regions previously unreachable, facilitating targeted drug delivery, localized treatment, and precise monitoring. These robots offer numerous advantages, including enhanced maneuverability, reduced invasiveness, and minimized tissue damage. By navigating through complex biological environments, micro robots can deliver therapies with unprecedented precision, improving treatment efficacy and patient outcomes. Additionally, their small size allows for minimally invasive procedures, reducing recovery times and enhancing patient comfort. Overall, micro robots represent a groundbreaking technological advancement with the potential to revolutionize healthcare delivery and significantly benefit human well-being.<br />Their small size enables access to intricate anatomical regions for targeted drug delivery, localized treatment, and precise monitoring. Despite challenges like size constraints and navigation complexities, innovative solutions and interdisciplinary collaboration are driving their advancement in improving healthcare outcomes.</p> Ata Jahangir Moshayedi Amir Sohail Khan Mehdi Davari Tahmineh Mokhtari Mehran Emadi Andani Copyright (c) 2024 Ata Jahangir Moshayedi, Amir Sohail Khan, Mehdi Davari, Tahmineh Mokhtari, Mehran Emadi Andani https://creativecommons.org/licenses/by-nc-sa/4.0 2024-04-12 2024-04-12 3 10.4108/airo.5602 An Overview of OpenAI's Sora and Its Potential for Physics Engine Free Games and Virtual Reality https://publications.eai.eu/index.php/airo/article/view/5273 <p>Sora, OpenAI's latest text-to-video model, is particularly skilled at understanding the physical world, and all of the content it generates mostly consistent with the laws of physics. This indicates that Sora already has the beginnings of a world model and has the potential to become an excellent physics engine in the near future. This paper analyses and explains in detail the potential applications of Sora in physics engines and virtual reality. In addition, its advantages and disadvantages over traditional physics engines are compared based on its unique behavioural characteristics. Finally, it looks forward to the application of Sora in other fields.</p> Zuyan Chen Shuai Li Md. Asraful Haque Copyright (c) 2024 Zuyan Chen, Shuai Li, Md. Asraful Haque https://creativecommons.org/licenses/by-nc-sa/4.0 2024-03-06 2024-03-06 3 10.4108/airo.5273