عنوان فارسی مقاله:ادغام شبکه مولد تخاصمی برای تصمیم گیری در یادگیری تقویتی برای عوامل ربات صنعتی

چکیده

بسیاری از سیستم‌های رباتیکی که محموله‌های خاصی را حمل می‌کنند در صنایع تولیدی برای کارهای انتخاب و مکان به کار می‌روند. اگر وزن کم یا زیاد معرفی شود، سیستم دچار ناکارآمدی می شود. اگر بار متفاوتی معرفی شود (به دلیل تغییر بار یا تغییر در پارامترهای سیستم ربات)، ربات باید با وزن/پارامترهای جدید مجدداً آموزش داده شود و شبکه جدید باید آموزش داده شود. پارامترهایی مانند وزن ربات، طول اندام یا محموله جدید ممکن است برای یک عامل بسته به شرایط متفاوت باشد. تغییرات پارامتر برای عامل در دستیابی به همان هدفی که انتظار می رود با پارامترهای اصلی به آن دست یابد، مشکل ایجاد می کند. از این رو، آموزش مجدد عامل با پارامترهای جدید برای رسیدن به هدف خود الزامی می شود. این تحقیق چارچوب جدیدی را برای انطباق شرایط مختلف بر روی یک عامل ربات در یک محیط شبیه‌سازی شده بدون هیچ گونه آموزش مجدد پیشنهاد می‌کند. با استفاده از ویژگی‌های شبکه مولد تخاصمی (GAN)، عامل می‌تواند تنها یک بار با یادگیری تقویتی تمرین کند و با تغییر بردار نویز مولد در شبکه GAN، عامل می‌تواند متناسب با شرایط جدید وفق دهد و عملکرد مشابهی را نشان دهد. آنها با ویژگی های فیزیکی جدید با استفاده از یادگیری تقویتی آموزش داده شدند. یک محیط CartPole ساده برای آزمایش در نظر گرفته شده است، و نشان داده شده است که با رویکرد پیشنهادی، عامل برای تکرارهای بیشتر پایدار می ماند. این رویکرد را می توان در آینده به دنیای واقعی نیز تعمیم داد.

ادامه مطلب

راهنمای خرید:

لینک دانلود فایل بلافاصله بعد از پرداخت وجه به نمایش در خواهد آمد.
همچنین لینک دانلود به ایمیل شما ارسال خواهد شد به همین دلیل ایمیل خود را به دقت وارد نمایید.
ممکن است ایمیل ارسالی به پوشه اسپم یا Bulk ایمیل شما ارسال شده باشد.
در صورتی که به هر دلیلی موفق به دانلود فایل مورد نظر نشدید با ما تماس بگیرید.

Title: Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents

Abstract

Many robotics systems carrying certain payloads are employed in manufacturing industries for pick and place tasks. The system experiences inefficiency if more or less weight is introduced. If a different payload is introduced (either due to a change in the load or a change in the parameters of the robot system), the robot must be re-trained with the new weight/parameters and the new network must be trained. Parameters such as the robot weight, length of limbs, or new payload may vary for an agent depending on the circumstance. Parameter changes pose a problem to the agent in achieving the same goal it is expected to achieve with the original parameters. Hence, it becomes mandatory to re-train the agent with the new parameters in order for it to achieve its goal. This research proposes a novel framework for the adaption of varying conditions on a robot agent in a given simulated environment without any retraining. Utilizing the properties of Generative Adversarial Network (GAN), the agent is able to train only once with reinforcement learning and by tweaking the noise vector of the generator in the GAN network, the agent can adapt to new conditions accordingly and demonstrate similar performance as if it were trained with the new physical attributes using reinforcement learning. A simple CartPole environment is considered for the experimentation, and it is shown that with the propose approached the agent remains stable for more iterations. The approach can be extended to the real world in the future.