# 周期駆動散逸量子系における非平衡定常状態の一般記述

General description for nonequilibrium steady states in periodically driven dissipative quantum systems ( http://arxiv.org/abs/2003.02876v2 )

Tatsuhiko N. Ikeda, Masahiro Sato(参考訳) レーザー技術は科学と工学の両方の観点から光誘起非平衡物理学を発展させ、加速している。 フロッケ工学(英: floquet engineering)は、光間相互作用の量子物理学の最前線であるが、理想的な散逸系に限定されている。 フロック工学が様々な材料に拡張されるためには、周期的な駆動とエネルギー散逸のバランスで生じる量子状態を理解することが不可欠である。 ここでは,周期的に駆動される散逸系における非平衡定常状態(ness)の一般記述を,高周波駆動系と時間に依存しないリンドブラッド型散逸系に詳細なバランス条件を課すことで導出する。 本式は,NESSの時間平均,ゆらぎ,対称性特性を正確に記述し,数値計算で効率的に計算できる。 このアプローチは、原子や分子、メソスコピック系、凝縮物といった幅広い散逸量子系において、フロッケ工学において基本的な役割を果たす。

Laser technology has developed and accelerated photo-induced nonequilibrium physics from both scientific and engineering viewpoints. The Floquet engineering, i.e., controlling material properties and functionalities by time-periodic drives, is a forefront of quantum physics of light-matter interaction, but limited to ideal dissipationless systems. For the Floquet engineering extended to a variety of materials, it is vital to understand the quantum states emerging in a balance of the periodic drive and energy dissipation. Here we derive the general description for nonequilibrium steady states (NESS) in periodically driven dissipative systems by focusing on the systems under high-frequency drive and time-independent Lindblad-type dissipation with the detailed balance condition. Our formula correctly describes the time-average, fluctuation, and symmetry property of the NESS, and can be computed efficiently in numerical calculations. Our approach will play fundamental roles in Floquet engineering in a broad class of dissipative quantum systems such as atoms and molecules, mesoscopic systems, and condensed matter.
翻訳日:2023-05-30 11:26:40 公開日:2020-07-04
# 非エルミート量子不純物のラムゼー干渉計

Ramsey interferometry of non-Hermitian quantum impurities ( http://arxiv.org/abs/2003.07378v2 )

F. Tonielli, N. Chakraborty, F. Grusdt, J. Marino(参考訳) 任意のリンドブラッドダイナミクスに関連する非エルミートハミルトニアンを抽出するラムゼーパルススキームを提案する。 我々は,非エルミート・ハミルトニアン自身と時間的に進化する一般状態の一般Loschmidtエコーをインターフェロメトリーを用いて測定するための実測プロトコルを提案し,このスキームを確率的原子不純物に結合した1次元の弱い相互作用を持つボース気体に適用する。 ロスシュミットエコーは、任意の不純物強度で長時間のデコヒーリングダイナミクスを計算する関数積分にマッピングされる。 不純物自己エネルギーによるデコヒーレンス指数の補正は、その代わりに量子コヒーレンスの減衰を増大させる小さな散逸の仕組みとは対照的に、純粋に想像上のものとなる。 本研究は, ラムゼイ干渉法を用いて凝縮物および低温原子系における散逸量子不純物を研究する実験の展望を示す。

We introduce a Ramsey pulse scheme which extracts the non-Hermitian Hamiltonian associated to an arbitrary Lindblad dynamics. We propose a realted protocol to measure via interferometry a generalised Loschmidt echo of a generic state evolving in time with the non-Hermitian Hamiltonian itself, and we apply the scheme to a one-dimensional weakly interacting Bose gas coupled to a stochastic atomic impurity. The Loschmidt echo is mapped into a functional integral from which we calculate the long-time decohering dynamics at arbitrary impurity strengths. For strong dissipation we uncover the phenomenology of a quantum many-body Zeno effect: corrections to the decoherence exponent resulting from the impurity self-energy becomes purely imaginary, in contrast to the regime of small dissipation where they instead enhance the decay of quantum coherences. Our results illustrate the prospects for experiments employing Ramsey interferometry to study dissipative quantum impurities in condensed matter and cold atoms systems.
翻訳日:2023-05-29 00:14:13 公開日:2020-07-04
# 閉じ込められたイオン鎖における複素測地と合成ゲージ場による量子シミュレーション

Quantum simulations with complex geometries and synthetic gauge fields in a trapped ion chain ( http://arxiv.org/abs/2007.02139v1 )

Tom Manovitz and Yotam Shapira and Nitzan Akerman and Ady Stern and Roee Ozeri(参考訳) 近年、線形RFトラップ中の原子イオンの配列は、特に量子シミュレーションのプラットフォームとして成功していることが証明されている。 しかし、これまでのところ、様々な量子モデルや現象はそのようなシミュレータの範囲を超えている。 本稿では,イオン鎖に沿った外部磁場勾配と大域的一様駆動場を用いて,この限界を大きく拡張する手法を提案する。 この手法は、捕捉されたイオンの線形鎖内で静的および時変合成ゲージ場を生成することができ、周期境界条件や高次元ハミルトニアンを含む様々な結合ジオメトリとトポロジーの連続的なシミュレーションを可能にする。 本手法は実効ハミルトニアンを導出し,様々なバリエーションを提案し,量子アドバンテージサイズのシミュレータへのスケーリングの可能性について議論する。 さらに、いくつかの実装を提案するとともに、Aharonov-Bohm環とフラストレーションのある三角形のはしごの2つを簡潔に検討する。

In recent years, arrays of atomic ions in a linear RF trap have proven to be a particularly successful platform for quantum simulation. However, a wide range of quantum models and phenomena have, so far, remained beyond the reach of such simulators. In this work we introduce a technique that can substantially extend this reach using an external field gradient along the ion chain and a global, uniform driving field. The technique can be used to generate both static and time-varying synthetic gauge fields in a linear chain of trapped ions, and enables continuous simulation of a variety of coupling geometries and topologies, including periodic boundary conditions and high dimensional Hamiltonians. We describe the technique, derive the corresponding effective Hamiltonian, propose a number of variations, and discuss the possibility of scaling to quantum-advantage sized simulators. Additionally, we suggest several possible implementations and briefly examine two: the Aharonov-Bohm ring and the frustrated triangular ladder.
翻訳日:2023-05-11 08:16:43 公開日:2020-07-04
# くさびの識別を持つグラフェン量子環の電子的性質

Electronic Properties of Graphene Quantum Ring with Wedge Disclination ( http://arxiv.org/abs/2007.02138v1 )

Abdelhadi Belouad, Ahmed Jellal, Hocine Bahlouli(参考訳) 我々は、磁束を受ける半径$r$と幅$w$のグラフェン量子環形状に閉じ込められた電荷キャリアのエネルギースペクトルと持続電流を研究した。 結晶対称性が、六角形を五角形、四角形、八角形、八角形に置き換えることで局所的に変化する場合を考える。 このタイプの欠陥をモデル化するために、角座標に対する適切な境界条件を含む。 電子は、ストリップの端に無限質量境界条件を設定することにより、半径方向の有限幅のストリップに制限される。 これらの解はハンケル関数の項で表現され、それらの漸近的挙動はエネルギーギャップの存在下で量子化されたエネルギー準位を導出することができる。 また、量子環に現れる永続電流と、ウェッジの偏差が異なる量子輸送量に与える影響についても検討する。

We study the energy spectrum and persistent current of charge carriers confined in a graphene quantum ring geometry of radius $R$ and width $w$ subjected to a magnetic flux. We consider the case where the crystal symmetry is locally modified by replacing a hexagon by a pentagon, square, heptagon or octagon. To model this type of defect we include appropriate boundary conditions for the angular coordinate. The electrons are confined to a finite width strip in radial direction by setting infinite mass boundary conditions at the edges of the strip. The solutions are expressed in terms of Hankel functions and their asymptotic behavior allows to derive quantized energy levels in the presence of an energy gap. We also investigate the persistent currents that appear in the quantum ring and how wedge disclination influences different quantum transport quantities.
翻訳日:2023-05-11 08:16:26 公開日:2020-07-04
# 臨床検証による放射像中心検索エンジン設計への最新の非SQLアプローチ

A Modern Non-SQL Approach to Radiology-Centric Search Engine Design with Clinical Validation ( http://arxiv.org/abs/2007.02124v1 )

ライセンス: Link先を確認
Healthcare data is increasing in size at an unprecedented speed with much attention on big data analysis and Artificial Intelligence application for quality assurance, clinical training, severity triaging, and decision support. Radiology is well-suited for innovation given its intrinsically paired linguistic and visual data. Previous attempts to unlock this information goldmine were encumbered by heterogeneity of human language, proprietary search algorithms, and lack of medicine-specific search performance matrices. We present a de novo process of developing a document-based, secure, efficient, and accurate search engine in the context of Radiology. We assess our implementation of the search engine with comparison to pre-existing manually collected clinical databases used previously for clinical research projects in addition to computational performance benchmarks and survey feedback. By leveraging efficient database architecture, search capability, and clinical thinking, radiologists are at the forefront of harnessing the power of healthcare data.
翻訳日:2023-05-11 08:16:09 公開日:2020-07-04
# ヒーリングスペース:高齢者認知症とその介護者に対する多感覚体験の実現可能性

Healing Spaces: Feasibility of a Multisensory Experience for Older Adults with Advanced Dementia and their Caregivers ( http://arxiv.org/abs/2007.02083v1 )

Gabriela Purri R. Gomes, Sydney Rubin, Leah I. Stein Duker, Donna Benton, Andreas Kratky, Sze Yu A Chen, Maryalice Jordan-Marsh, and Marientina Gotsis(参考訳) ヒーリングスペースは,高齢者の認知症の行動症状と心理的症状を改善する可能性を示す,多感覚介入に対する新しいアプローチを提案する。 スマートテクノロジーを使って、デジタルと物理の両方のコンポーネントを組み合わせることで、空間を変換し、対話に意味のあるコンテキストを提供する統一された、キュレートされた感覚体験を作り出す。 ヒーリング・スペース・アプリのユーザビリティスタディを行い,介護者および認知症進行段階の住民を対象に,記憶医療施設におけるフルエクスペリエンスの実現可能性について検討した。 認知症高齢者の記憶環境における癒し空間体験改善の領域だけでなく, 強度の照らしやすさの評価にも成功している。 介護者と施設管理者は、施設の住民と共に癒しスペースの使用を続けることに興味を示した。 ヒーリングスペースの技術的および物流的実装について学んだ教訓と、研究デザインの今後の方向性と、その経験の潜在的な治療的価値について論じる。

Healing Spaces proposes a new approach to multisensory interventions that show potential in ameliorating the behavioral and psychological symptoms of advanced dementia in older adults. Using smart technology, the project combines both digital and physical components to transform spaces and create unified, curated sensory experiences that provide meaningful context for interaction, and are easy for caregivers to deliver. A usability study was conducted for the Healing Spaces app followed by a feasibility evaluation of the full experience in a memory care facility recruiting caregivers, and residents in advanced stages of dementia. The feasibility evaluation successfully illuminated strengths as well as areas for improvement for the Healing Spaces experience in a memory care setting with older adults with advanced dementia. Caregivers and facility managers expressed interest in continuing to use Healing Spaces with the residents of the facility. Lessons learned about the technical and logistical implementation of Healing Spaces are discussed, as well as future directions for study design and potential therapeutic value of the experience.
翻訳日:2023-05-11 08:15:38 公開日:2020-07-04
# 量子レベルでの不可逆性の情報理論境界の単一原子検証

Single-Atom Verification of the Information-Theoretical Bound of Irreversibility at the Quantum Level ( http://arxiv.org/abs/2007.02027v1 )

J. W. Zhang, K. Rehan, M. Li, J. C. Li, L. Chen, S.-L. Su, L.-L. Yan, F. Zhou and M. Feng(参考訳) エントロピー生成に基づく乱れや乱れの定量的測定は、従来の熱力学の法則と関連する熱力学的不可逆性を特徴付ける。 ここでは,量子力学的手法を用いて,エントロピー生成に結びついた情報理論を初めて予測し,実験的に検討する。 我々の理論モデルは、純粋に古典的場によって駆動される最も単純な2段階の散逸系から成り、マルコフ散逸の下では、そのような情報理論的な境界は、量子緩和過程を完全に検証するものではなく、駆動-決定比と初期状態に大きく依存していることがわかった。 さらに、超低温に閉じ込められた$^{40}$ca$^{+}$イオンに埋め込まれた単一のスピンによって、この情報理論的な境界を実験的に検証する。 2段階のモデルに基づく我々の発見は、あらゆる量子熱力学過程の基本であり、従来の古典的熱力学に対する量子熱力学の相違と複雑さを示している。

Quantitative measure of disorder or randomness based on the entropy production characterizes thermodynamical irreversibility, which is relevant to the conventional second law of thermodynamics. Here we report, in a quantum mechanical fashion, the first theoretical prediction and experimental exploration of an information-theoretical bound on the entropy production. Our theoretical model consists of a simplest two-level dissipative system driven by a purely classical field, and under the Markovian dissipation, we find that such an information-theoretical bound, not fully validating quantum relaxation processes, strongly depends on the drive-to-decay ratio and the initial state. Furthermore, we carry out experimental verification of this information-theoretical bound by means of a single spin embedded in an ultracold trapped $^{40}$Ca$^{+}$ ion. Our finding, based on a two-level model, is fundamental to any quantum thermodynamical process and indicates much difference and complexity in quantum thermodynamics with respect to the conventionally classical counterpart.
翻訳日:2023-05-11 08:14:42 公開日:2020-07-04
# 安定偏光エンタングルメントに基づく大都市圏ネットワーク上の量子鍵分布

Stable Polarization Entanglement based Quantum Key Distribution over Metropolitan Fibre Network ( http://arxiv.org/abs/2007.01989v1 )

ライセンス: Link先を確認
We demonstrate a quantum key distribution implementation over deployed dark telecom fibers with polarisation-entangled photons generated at the O-band. One of the photons in the pairs are propagated through 10km of deployed fiber while the others are detected locally. Polarisation drifts experienced by the photons propagating through the fibers are compensated with liquid crystal variable retarders. This ensures continuous and stable QKD operation with an average QBER of 6.4% and a final key rate of 109 bits/s.
翻訳日:2023-05-11 08:14:17 公開日:2020-07-04
# オンライン破壊行為を特徴づける: 合理的選択の視点

Characterizing Online Vandalism: A Rational Choice Perspective ( http://arxiv.org/abs/2007.02199v1 )

ライセンス: Link先を確認
What factors influence the decision to vandalize? Although the harm is clear, the benefit to the vandal is less clear. In many cases, the thing being damaged may itself be something the vandal uses or enjoys. Vandalism holds communicative value: perhaps to the vandal themselves, to some audience at whom the vandalism is aimed, and to the general public. Viewing vandals as rational community participants despite their antinormative behavior offers the possibility of engaging with or countering their choices in novel ways. Rational choice theory (RCT) as applied in value expectancy theory (VET) offers a strategy for characterizing behaviors in a framework of rational choices, and begins with the supposition that subject to some weighting of personal preferences and constraints, individuals maximize their own utility by committing acts of vandalism. This study applies the framework of RCT and VET to gain insight into vandals' preferences and constraints. Using a mixed-methods analysis of Wikipedia, I combine social computing and criminological perspectives on vandalism to propose an ontology of vandalism for online content communities. I use this ontology to categorize 141 instances of vandalism and find that the character of vandalistic acts varies by vandals' relative identifiability, policy history with Wikipedia, and the effort required to vandalize.
翻訳日:2023-05-11 08:06:36 公開日:2020-07-04
# フェルミ海に浸漬したボソニック不純物の誘起相互作用とクエンチダイナミクス

Induced interactions and quench dynamics of bosonic impurities immersed in a Fermi sea ( http://arxiv.org/abs/2007.02166v1 )

ライセンス: Link先を確認
K. Mukherjee, S. I. Mistakidis, S. Majumder and P. Schmelcher(参考訳) 不純物-メジウム相互作用強度のクエンチを適用することにより、一次元フェルミオン環境に浸漬された2つのボソニック不純物の基底状態特性と非平衡量子力学を解明する。 地上状態では、不純物とフェルミ海は強い不純物-ナトリウム反発に対して相分離され、トラップセンター周辺で大きなアトラクションの局所化傾向を経験する。 本研究は, ホストが媒介する誘導的相互作用の存在を示し, 誘導的相互作用と直接的相互作用の競合を分析する。 反発的な相互作用へのクエンチに追従すると、両成分の呼吸運動が引き起こされ、不純物に対する相互作用に依存する周波数と振幅、強い反発のために不純物とその周囲との間の動的相分離が引き起こされる。 魅力的なポストクエンチ結合では、その存在から誘導相互作用の主役となるビーティングパターンが、トラップ中心の周辺に局在傾向を示す両方のコンポーネントで起こる。 どちらのクエンチシナリオにおいても、非相互作用不純物の間に魅力的な誘導相関が示され、クエンチが魅力的なカップリングにのみ直接的に支配される。

We unravel the ground state properties and the non-equilibrium quantum dynamics of two bosonic impurities immersed in an one-dimensional fermionic environment by applying a quench of the impurity-medium interaction strength. In the ground state, the impurities and the Fermi sea are phase-separated for strong impurity-medium repulsions while they experience a localization tendency around the trap center for large attractions. We demonstrate the presence of attractive induced interactions mediated by the host for impurity-medium couplings of either sign and analyze the competition between induced and direct interactions. Following a quench to repulsive interactions triggers a breathing motion in both components, with an interaction dependent frequency and amplitude for the impurities, and a dynamical phase-separation between the impurities and their surrounding for strong repulsions. For attractive post-quench couplings a beating pattern owing its existence to the dominant role of induced interactions takes place with both components showing a localization trend around the trap center. In both quench scenarios, attractive induced correlations are manifested between non-interacting impurities and are found to dominate the direct ones only for quenches to attractive couplings.
翻訳日:2023-05-11 08:05:44 公開日:2020-07-04
# EOSブロックチェーンを用いたスケーラブルなロールベースアクセス制御

Scalable Role-based Access Control Using The EOS Blockchain ( http://arxiv.org/abs/2007.02163v1 )

ライセンス: Link先を確認
Role-based access control (RBAC) policies represent the rights of subjects in terms of roles to access resources. This research proposes a scalable, flexible and auditable RBAC system using the EOS blockchain platform to meet the security requirements of organizations. The EOS blockchain platform for developing smart contract and decentralized applications (DAPPs) aims to address the scalability problem found in existing blockchain platforms. This smart contract platform aims to eliminate transaction fees while conducting millions of transactions per second. In our proposed approach, the EOS blockchain transparently stores RBAC policies. Administrative roles control access to resources at a higher level according to the way organisations perform operations. An organisation creates roles, role hierarchies and constraints to regulate user actions. Therefore, once an RBAC framework is established, the administrative user (issuer) only needs to grant and revoke roles to support changes in the organisational structure. Our proposed blockchain-based RBAC supports delegation capabilities using gaseless transactions which makes it adoptable and appealing in a large number of application scenarios. Our proposed solution is application-agnostic and well-suited for diverse use cases. Existing state-of-the art security frameworks are not suitable due to the difficulty of scale, higher cost and single point of failure. Consequently, organisations demand a scalable, cost-effective and lightweight access control solution which can better protect their privacy as well. A proof of concept implementation is developed based on the EOS blockchain. Our experimental results and analysis clearly show that our EOS blockchain-based RBAC outperforms existing blockchain platforms in terms of cost, latency, block generation time, contract execution time and throughput.
翻訳日:2023-05-11 08:05:22 公開日:2020-07-04
# ブロックチェーンベースの達成記録システム構築に必要なことの検討

Investigating the Requirements for Building a Blockchain- Based Achievement Record System ( http://arxiv.org/abs/2007.02162v1 )

ライセンス: Link先を確認
A trusted achievement record is a secure system that aims to record and authenticate certificates as well as key learning activities and achievements. This paper intends to gather important information on the thoughts and outlooks of stakeholders on an achievement record system that uses blockchain and smart contract technology. The system would allow stakeholders (for example employers) to validate learning records. Two main aims are investigated. The first is to evaluate the suitability of the idea of building a trusted achievement record for learners in higher education, and to evaluate potential user knowledge of blockchain technology. This is to ensure that a designed system is usable. The second aim includes an interview conducted with a small group of participants to gather information about the challenges individuals have when creating, and reviewing CVs. Overall, 90% of participants agreed that there was a strong need for a trusted achievement record. In addition, 93.64% of respondents stated that they felt it was invaluable to have a system that is usable by all stakeholders. When tackling the second aim it was found that a primary challenge is lack of knowledge of blockchain and its complexity. From the employers' perspective, there is a lack of trust due to inaccuracies when students describe skills and qualifications in their resumes.
翻訳日:2023-05-11 08:04:59 公開日:2020-07-04
# ブロックチェーンベースのtrusted achievement recordシステム設計

Blockchain-Based Trusted Achievement Record System Design ( http://arxiv.org/abs/2007.02161v1 )

ライセンス: Link先を確認
The primary purpose of this paper is to provide a design of a blockchain-based system, which produces a verifiable record of achievements. Such a system has a wide range of potential benefits for students, employers and higher education institutions. A verifiable record of achievements enables students to present academic accomplishments to employers, within a trusted framework. Furthermore, the availability of such a record system would enable students to review their learning throughout their career, giving them a platform on which to plan for their future accomplishments, both individually and with support from other parties (for example, academic advisors, supervisors, or potential employers). The proposed system will help students in universities to increase their extra-curricular activities and improve non-academic skills. Moreover, the system will facilitate communication between industry, students, and universities for employment purposes and simplify the search for the most appropriate potential employees for the job.
翻訳日:2023-05-11 08:04:41 公開日:2020-07-04
# ナノキャビティにおけるエンタングルフェルミオン-光子-フォノン状態の生成とダイナミクス

Generation and dynamics of entangled fermion-photon-phonon states in nanocavities ( http://arxiv.org/abs/2007.02159v1 )

ライセンス: Link先を確認
Mikhail Tokman, Maria Erukhimova, Yongrui Wang, Qianfan Chen, Alexey Belyanin(参考訳) 我々は、ナノキャビティにおける量子化電磁場と量子化フォノンまたは機械振動モードに結合したフェルミオン量子エミッタの絡み合った量子状態の生成と進化を記述する解析理論を開発する。 この理論は、幅広いキャビティ量子光力学の問題や、単一分子や他の量子エミッターと結合したプラズモニックナノキャビティに関する新たな研究に適用できる。 三状態エンタングルメントの最適条件は、結合系におけるパラメトリック共鳴の近傍で実現される。 このモデルには、フェルミオン、フォトン、フォノンのサブシステムの、ハイゼンベルク・ランゲヴィン形式から派生した確率進化的アプローチにおける散逸的な貯水池への結合によるデコヒーレンス効果が含まれる。 我々の理論は、量子状態と可観測物の時間進化と放射スペクトルの分析式を提供する。 古典的な音響ポンピングの限界とパラメトリックと標準1光子共鳴の相互作用を分析する。

We develop the analytic theory describing the formation and evolution of entangled quantum states for a fermionic quantum emitter coupled to a quantized electromagnetic field in a nanocavity and quantized phonon or mechanical vibrational modes. The theory is applicable to a broad range of cavity quantum optomechanics problems and emerging research on plasmonic nanocavities coupled to single molecules and other quantum emitters. The optimal conditions for a tri-state entanglement are realized near the parametric resonances in a coupled system. The model includes decoherence effects due to coupling of the fermion, photon, and phonon subsystems to their dissipative reservoirs within the stochastic evolution approach, which is derived from the Heisenberg-Langevin formalism. Our theory provides analytic expressions for the time evolution of the quantum state and observables, and the emission spectra. The limit of a classical acoustic pumping and the interplay between parametric and standard one-photon resonances are analyzed.
翻訳日:2023-05-11 08:04:26 公開日:2020-07-04
# モース様時間依存周波数を持つパラメトリック発振器の古典的・量子的解析

Classical and quantum analysis of the parametric oscillator with Morse-like time-dependent frequency ( http://arxiv.org/abs/2007.02150v1 )

ライセンス: Link先を確認
Mariagiovanna Gianfreda and Giulio Landolfi(参考訳) パラメトリック発振器によって記述される量子システムの基本特性を理解する問題は、時間依存周波数パラメータ$\omega(t)$ が進化の過程で変化し、非調和ホールかバリアのいずれかを表示する。 このスコープでは、$\omega(t)^2$ が Morse ポテンシャルのように振る舞う場合に焦点を当て、$(t,\omega^2)$ 平面の符号の逆転や変換が可能である。 準正規モードの時間依存振幅に対する閉形式解を導出するが、これは時間依存二次系の古典力学と量子力学の両方の記述に入る非常に基本的な動的対象であることが知られている。 このような量が決定され、その重要な特徴が強調されると、数型状態に対する二階相関関数によって暗示される位置モーメント・ハイゼンベルクの不確実性原理と統計学的側面に注意を払い、量子状態の進化に関するより洗練された洞察を与える。

We consider the problem of understanding the basic features displayed by quantum systems described by parametric oscillators whose time-dependent frequency parameter $\omega(t)$ varies during evolution so to display either a non harmonic hole or barrier. To this scope we focus on the case where $\omega(t)^2$ behaves like a Morse potential, up to possible sign reversion and translations in the $(t,\omega^2)$ plane. We derive closed form solution for the time-dependent amplitude of quasi-normal modes, that is known to be the very fundamental dynamical object entering the description of both classical and quantum dynamics of time-dependent quadratic systems. Once such quantity is determined and its significant characteristics highlighted, we provide a more refined insight on the way quantum states evolve by paying attention on the position-momentum Heisenberg uncertainty principle and the statistical aspects implied by second-order correlation functions over number-type states.
翻訳日:2023-05-11 08:04:09 公開日:2020-07-04
# 構造ラベル平滑化による規則化

Regularization via Structural Label Smoothing ( http://arxiv.org/abs/2001.01900v2 )

ライセンス: Link先を確認
Weizhi Li, Gautam Dasarathy and Visar Berisha(参考訳) 正規化は機械学習モデルの一般化性能を促進する効果的な方法である。 本稿では,信頼度の高い出力をペナルティ化するために,トレーニングデータ中の接地ラベルを軟化することにより,ニューラルネットワークの過剰フィッティングを防止する出力分布正規化方式であるラベル平滑化に着目した。 既存のアプローチでは、通常、すべてのトレーニングデータに対して均一な、この平滑化を強制するためにクロスバリデーションを使用する。 本稿では,このようなラベル平滑化が,高い重なりと低い辺縁確率を有する特徴空間の領域と、高いバイアスを持つ低重なり・高辺縁確率の領域とで,トレーニングデータのベイズ誤差率に定量化可能なバイアスを課すことを示す。 これらの理論的な結果は、データ依存の平滑化のための単純な客観的関数を動機付け、操作の潜在的な負の結果を緩和し、その望ましい特性を正則化として維持する。 この手法をStructure Label Smoothing (SLS)と呼ぶ。 我々はSLSを実装し,合成,ヒッグス,SVHN,CIFAR-10,CIFAR-100データセットを実証的に検証した。 その結果,従来のラベル平滑化法と比較して,提案手法の有効性が実証された。

Regularization is an effective way to promote the generalization performance of machine learning models. In this paper, we focus on label smoothing, a form of output distribution regularization that prevents overfitting of a neural network by softening the ground-truth labels in the training data in an attempt to penalize overconfident outputs. Existing approaches typically use cross-validation to impose this smoothing, which is uniform across all training data. In this paper, we show that such label smoothing imposes a quantifiable bias in the Bayes error rate of the training data, with regions of the feature space with high overlap and low marginal likelihood having a lower bias and regions of low overlap and high marginal likelihood having a higher bias. These theoretical results motivate a simple objective function for data-dependent smoothing to mitigate the potential negative consequences of the operation while maintaining its desirable properties as a regularizer. We call this approach Structural Label Smoothing (SLS). We implement SLS and empirically validate on synthetic, Higgs, SVHN, CIFAR-10, and CIFAR-100 datasets. The results confirm our theoretical insights and demonstrate the effectiveness of the proposed method in comparison to traditional label smoothing.
翻訳日:2023-01-13 20:17:48 公開日:2020-07-04
# 直交不等角表現の学習による公平性

Fairness by Learning Orthogonal Disentangled Representations ( http://arxiv.org/abs/2003.05707v3 )

Mhd Hasan Sarhan, Nassir Navab, Abouzar Eslami, Shadi Albarqouni(参考訳) 判別可能な強力な表現の学習は、機械学習システムにとって重要なステップである。 特定のタスクでうまく実行しながら、任意の迷惑や繊細な属性に対する不変性を導入することは、表現学習において重要な問題である。 これは主に、学習した表現からセンシティブな情報を浄化することでアプローチされる。 本稿では,不変表現問題に対する新しい不等角化手法を提案する。 我々は,独立性の代理として直交性制約を強制することにより,意味的かつ敏感な表現を解消する。 エントロピーの最大化により、意味のある表現が機密情報に依存しないように明示的に強制する。 提案手法は5つの公開データセットで評価され,3つのデータセット上でのアートパフォーマンスの状態と,それと同等のパフォーマンスを達成するためのフェアネスと不変性を学習する技術手法の状況と比較する。 さらに,各成分の効果を評価するためのアブレーション研究を行った。

Learning discriminative powerful representations is a crucial step for machine learning systems. Introducing invariance against arbitrary nuisance or sensitive attributes while performing well on specific tasks is an important problem in representation learning. This is mostly approached by purging the sensitive information from learned representations. In this paper, we propose a novel disentanglement approach to invariant representation problem. We disentangle the meaningful and sensitive representations by enforcing orthogonality constraints as a proxy for independence. We explicitly enforce the meaningful representation to be agnostic to sensitive information by entropy maximization. The proposed approach is evaluated on five publicly available datasets and compared with state of the art methods for learning fairness and invariance achieving the state of the art performance on three datasets and comparable performance on the rest. Further, we perform an ablative study to evaluate the effect of each component.
翻訳日:2022-12-24 14:05:20 公開日:2020-07-04
# 深部3次元キャプチャ:スパースマルチビュー画像からの幾何学と反射

Deep 3D Capture: Geometry and Reflectance from Sparse Multi-View Images ( http://arxiv.org/abs/2003.12642v2 )

Sai Bi, Zexiang Xu, Kalyan Sunkavalli, David Kriegman, Ravi Ramamoorthi(参考訳) そこで本研究では,任意の物体の高画質かつ複雑なBRDFを,広視野カメラで撮影した6つの画像のスパース集合から再構成する学習手法を提案する。 まず、深層多視点ステレオネットワークを用いてビューごとの深度マップを推定し、これらの深度マップを用いて異なるビューを粗く整列する。 本稿では,これらの粗い整列画像から特徴をプールし,空間的に異なる拡散アルベド,表面正規度,スペクトル粗さ,スペクトルアルベドを推定する,新しい多視点反射率推定ネットワークアーキテクチャを提案する。 我々は,マルチビュー・リフレクタンス・ネットワークの潜時空間を協調的に最適化し,予測画像と入力画像との光度誤差を最小化する。 従来の最先端の手法は、このようなスパースな取得装置では失敗するが、合成および実データに関する広範な実験により、本手法は、フォトリアリスティック画像の描画に使用できる高品質な再構成を生成することを実証する。

We introduce a novel learning-based method to reconstruct the high-quality geometry and complex, spatially-varying BRDF of an arbitrary object from a sparse set of only six images captured by wide-baseline cameras under collocated point lighting. We first estimate per-view depth maps using a deep multi-view stereo network; these depth maps are used to coarsely align the different views. We propose a novel multi-view reflectance estimation network architecture that is trained to pool features from these coarsely aligned images and predict per-view spatially-varying diffuse albedo, surface normals, specular roughness and specular albedo. We do this by jointly optimizing the latent space of our multi-view reflectance network to minimize the photometric error between images rendered with our predictions and the input images. While previous state-of-the-art methods fail on such sparse acquisition setups, we demonstrate, via extensive experiments on synthetic and real data, that our method produces high-quality reconstructions that can be used to render photorealistic images.
翻訳日:2022-12-19 05:30:02 公開日:2020-07-04
# Deep Fashion3D: 単一画像からの3次元ガーメント再構成のためのデータセットとベンチマーク

Deep Fashion3D: A Dataset and Benchmark for 3D Garment Reconstruction from Single Images ( http://arxiv.org/abs/2003.12753v2 )

Heming Zhu, Yu Cao, Hang Jin, Weikai Chen, Dong Du, Zhangye Wang, Shuguang Cui, Xiaoguang Han(参考訳) 高忠実な服の復元は、人間のデジタル化や仮想試着など、幅広い応用においてフォトリアリズムを実現するための鍵である。 学習に基づくアプローチの最近の進歩は、多数の身体スキャンから学んだ強力な統計モデル(SMPLなど)が利用可能であることから、未着衣のヒトの形状を復元し、単一の画像からポーズする上で、前例のない精度を達成している。 対照的に、人間の衣服と3D服のモデリングと復元は、研究コミュニティで利用可能な大規模な衣服モデルが不足していることから、非常に難しい。 本稿では,3次元衣料モデルの最大コレクションであるDeep Fashion3Dを導入することで,このギャップを埋めることを提案する。 deep fashion3dには、10の異なるカテゴリと533の衣料品を対象とする、本物の衣料品から復元された2078のモデルが含まれている。 3D機能ライン、3Dボディポーズ、対応するマルチビューリアルイメージなど、リッチなアノテーションを提供する。 また、各衣料品をランダムにポーズして実際の衣料変形の多様性を高める。 深層ファッション3dの利点を実証するために,メッシュ表現と暗黙表現の両方の利点を生かした単視点衣服復元のための新しいベースラインアプローチを提案する。 一つのネットワークであらゆる種類の衣服を学習できる新しい適応型テンプレートが提案されている。 提案するデータセットについて,その意義と有用性を検証するために広範な実験が行われている。 Deep Fashion3Dを公開して公開します。

High-fidelity clothing reconstruction is the key to achieving photorealism in a wide range of applications including human digitization, virtual try-on, etc. Recent advances in learning-based approaches have accomplished unprecedented accuracy in recovering unclothed human shape and pose from single images, thanks to the availability of powerful statistical models, e.g. SMPL, learned from a large number of body scans. In contrast, modeling and recovering clothed human and 3D garments remains notoriously difficult, mostly due to the lack of large-scale clothing models available for the research community. We propose to fill this gap by introducing Deep Fashion3D, the largest collection to date of 3D garment models, with the goal of establishing a novel benchmark and dataset for the evaluation of image-based garment reconstruction systems. Deep Fashion3D contains 2078 models reconstructed from real garments, which covers 10 different categories and 563 garment instances. It provides rich annotations including 3D feature lines, 3D body pose and the corresponded multi-view real images. In addition, each garment is randomly posed to enhance the variety of real clothing deformations. To demonstrate the advantage of Deep Fashion3D, we propose a novel baseline approach for single-view garment reconstruction, which leverages the merits of both mesh and implicit representations. A novel adaptable template is proposed to enable the learning of all types of clothing in a single network. Extensive experiments have been conducted on the proposed dataset to verify its significance and usefulness. We will make Deep Fashion3D publicly available upon publication.
翻訳日:2022-12-18 23:45:32 公開日:2020-07-04
# l$^2$-gcn:階層的学習とグラフ畳み込みネットワークの効率的な学習

L$^2$-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks ( http://arxiv.org/abs/2003.13606v11 )

Yuning You, Tianlong Chen, Zhangyang Wang, Yang Shen(参考訳) グラフ畳み込みネットワーク(GCN)は、多くのアプリケーションで人気が高まりつつあるが、大きなグラフデータセットをトレーニングするのは難しい。 隣人から再帰的にノード表現を計算する必要がある。 現在のGCNトレーニングアルゴリズムは、レイヤーの数で指数関数的に増加する高い計算コストや、グラフ全体とノードの埋め込みをロードするメモリ使用量に悩まされている。 本稿では,GCN(L-GCN)のための新しいレイヤワイドトレーニングフレームワークを提案する。 我々はL-GCNをグラフ同型フレームワークで理論的に解析し、L-GCNはよりコストのかかる訓練アルゴリズムほど強力なGCNをもたらす。 さらにL$^2$-GCNを提案し、L-GCNにおける各層毎のトレーニングエポックを自動的に調整できる各層のコントローラを学習する。 実験の結果、L-GCNは少なくとも1桁の精度で最新技術よりも高速であり、メモリ使用量の一貫性はデータセットのサイズに依存しない。 学習したコントローラでは、L$^2$-GCNはトレーニング時間を半減することができる。 私たちのコードはhttps://github.com/shen-lab/l2-gcnで利用可能です。

Graph convolution networks (GCN) are increasingly popular in many applications, yet remain notoriously hard to train over large graph datasets. They need to compute node representations recursively from their neighbors. Current GCN training algorithms suffer from either high computational costs that grow exponentially with the number of layers, or high memory usage for loading the entire graph and node embeddings. In this paper, we propose a novel efficient layer-wise training framework for GCN (L-GCN), that disentangles feature aggregation and feature transformation during training, hence greatly reducing time and memory complexities. We present theoretical analysis for L-GCN under the graph isomorphism framework, that L-GCN leads to as powerful GCNs as the more costly conventional training algorithm does, under mild conditions. We further propose L$^2$-GCN, which learns a controller for each layer that can automatically adjust the training epochs per layer in L-GCN. Experiments show that L-GCN is faster than state-of-the-arts by at least an order of magnitude, with a consistent of memory usage not dependent on dataset size, while maintaining comparable prediction performance. With the learned controller, L$^2$-GCN can further cut the training time in half. Our codes are available at https://github.com/Shen-Lab/L2-GCN.
翻訳日:2022-12-18 06:50:16 公開日:2020-07-04
# 多頭部注意型確率的車両軌道予測

Multi-Head Attention based Probabilistic Vehicle Trajectory Prediction ( http://arxiv.org/abs/2004.03842v3 )

ライセンス: Link先を確認
This paper presents online-capable deep learning model for probabilistic vehicle trajectory prediction. We propose a simple encoder-decoder architecture based on multi-head attention. The proposed model generates the distribution of the predicted trajectories for multiple vehicles in parallel. Our approach to model the interactions can learn to attend to a few influential vehicles in an unsupervised manner, which can improve the interpretability of the network. The experiments using naturalistic trajectories at highway show the clear improvement in terms of positional error on both longitudinal and lateral direction.
翻訳日:2022-12-15 08:36:27 公開日:2020-07-04
# ユーモラスにしよう:知識に富んだHummor Generation

Let's be Humorous: Knowledge Enhanced Humor Generation ( http://arxiv.org/abs/2004.13317v2 )

ライセンス: Link先を確認
The generation of humor is an under-explored and challenging problem. Previous works mainly utilize templates or replace phrases to generate humor. However, few works focus on freer forms and the background knowledge of humor. The linguistic theory of humor defines the structure of a humor sentence as set-up and punchline. In this paper, we explore how to generate a punchline given the set-up with the relevant knowledge. We propose a framework that can fuse the knowledge to end-to-end models. To our knowledge, this is the first attempt to generate punchlines with knowledge enhanced model. Furthermore, we create the first humor-knowledge dataset. The experimental results demonstrate that our method can make use of knowledge to generate fluent, funny punchlines, which outperforms several baselines.
翻訳日:2022-12-08 23:00:34 公開日:2020-07-04
# ロバストな2次元特異値分解のためのカーネルリスク感性損失の一般化

A Generalized Kernel Risk Sensitive Loss for Robust Two-Dimensional Singular Value Decomposition ( http://arxiv.org/abs/2005.04671v2 )

ライセンス: Link先を確認
Two-dimensional singular decomposition (2DSVD) has been widely used for image processing tasks, such as image reconstruction, classification, and clustering. However, traditional 2DSVD algorithm is based on the mean square error (MSE) loss, which is sensitive to outliers. To overcome this problem, we propose a robust 2DSVD framework based on a generalized kernel risk sensitive loss (GKRSL-2DSVD) which is more robust to noise and and outliers. Since the proposed objective function is non-convex, a majorization-minimization algorithm is developed to efficiently solve it with guaranteed convergence. The proposed framework has inherent properties of processing non-centered data, rotational invariant, being easily extended to higher order spaces. Experimental results on public databases demonstrate that the performance of the proposed method on different applications significantly outperforms that of all the benchmarks.
翻訳日:2022-12-05 01:38:43 公開日:2020-07-04
# もっと悪いが、より良いBLEU? マルチタスク音声翻訳における単語埋め込みの中間的活用

Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation ( http://arxiv.org/abs/2005.10678v2 )

ライセンス: Link先を確認
Speech translation (ST) aims to learn transformations from speech in the source language to the text in the target language. Previous works show that multitask learning improves the ST performance, in which the recognition decoder generates the text of the source language, and the translation decoder obtains the final translations based on the output of the recognition decoder. Because whether the output of the recognition decoder has the correct semantics is more critical than its accuracy, we propose to improve the multitask ST model by utilizing word embedding as the intermediate.
翻訳日:2022-11-30 23:28:37 公開日:2020-07-04
# ディープラーニングによる自動運転:最先端技術に関する調査

Autonomous Driving with Deep Learning: A Survey of State-of-Art Technologies ( http://arxiv.org/abs/2006.06091v3 )

ライセンス: Link先を確認
Since DARPA Grand Challenges (rural) in 2004/05 and Urban Challenges in 2007, autonomous driving has been the most active field of AI applications. Almost at the same time, deep learning has made breakthrough by several pioneers, three of them (also called fathers of deep learning), Hinton, Bengio and LeCun, won ACM Turin Award in 2019. This is a survey of autonomous driving technologies with deep learning methods. We investigate the major fields of self-driving systems, such as perception, mapping and localization, prediction, planning and control, simulation, V2X and safety etc. Due to the limited space, we focus the analysis on several key areas, i.e. 2D and 3D object detection in perception, depth estimation from cameras, multiple sensor fusion on the data, feature and task level respectively, behavior modelling and prediction of vehicle driving and pedestrian trajectories.
翻訳日:2022-11-23 05:59:00 公開日:2020-07-04
# 運転行動予測における文脈知識の導入に向けて

Towards Incorporating Contextual Knowledge into the Prediction of Driving Behavior ( http://arxiv.org/abs/2006.08470v2 )

ライセンス: Link先を確認
Predicting the behavior of surrounding traffic participants is crucial for advanced driver assistance systems and autonomous driving. Most researchers however do not consider contextual knowledge when predicting vehicle motion. Extending former studies, we investigate how predictions are affected by external conditions. To do so, we categorize different kinds of contextual information and provide a carefully chosen definition as well as examples for external conditions. More precisely, we investigate how a state-of-the-art approach for lateral motion prediction is influenced by one selected external condition, namely the traffic density. Our investigations demonstrate that this kind of information is highly relevant in order to improve the performance of prediction algorithms. Therefore, this study constitutes the first step towards the integration of such information into automated vehicles. Moreover, our motion prediction approach is evaluated based on the public highD data set showing a maneuver prediction performance with areas under the ROC curve above 97% and a median lateral prediction error of only 0.18m on a prediction horizon of 5s.
翻訳日:2022-11-21 02:31:18 公開日:2020-07-04
# スパースニューラルネットワークのトポロジ的考察

Topological Insights into Sparse Neural Networks ( http://arxiv.org/abs/2006.14085v2 )

ライセンス: Link先を確認
Sparse neural networks are effective approaches to reduce the resource requirements for the deployment of deep neural networks. Recently, the concept of adaptive sparse connectivity, has emerged to allow training sparse neural networks from scratch by optimizing the sparse structure during training. However, comparing different sparse topologies and determining how sparse topologies evolve during training, especially for the situation in which the sparse structure optimization is involved, remain as challenging open questions. This comparison becomes increasingly complex as the number of possible topological comparisons increases exponentially with the size of networks. In this work, we introduce an approach to understand and compare sparse neural network topologies from the perspective of graph theory. We first propose Neural Network Sparse Topology Distance (NNSTD) to measure the distance between different sparse neural networks. Further, we demonstrate that sparse neural networks can outperform over-parameterized models in terms of performance, even without any further structure optimization. To the end, we also show that adaptive sparse connectivity can always unveil a plenitude of sparse sub-networks with very different topologies which outperform the dense model, by quantifying and comparing their topological evolutionary processes. The latter findings complement the Lottery Ticket Hypothesis by showing that there is a much more efficient and robust way to find "winning tickets". Altogether, our results start enabling a better theoretical understanding of sparse neural networks, and demonstrate the utility of using graph theory to analyze them.
翻訳日:2022-11-17 09:31:44 公開日:2020-07-04
# グラフ構造トピックニューラルネットワーク

Graph Structural-topic Neural Network ( http://arxiv.org/abs/2006.14278v2 )

ライセンス: Link先を確認
Graph Convolutional Networks (GCNs) achieved tremendous success by effectively gathering local features for nodes. However, commonly do GCNs focus more on node features but less on graph structures within the neighborhood, especially higher-order structural patterns. However, such local structural patterns are shown to be indicative of node properties in numerous fields. In addition, it is not just single patterns, but the distribution over all these patterns matter, because networks are complex and the neighborhood of each node consists of a mixture of various nodes and structural patterns. Correspondingly, in this paper, we propose Graph Structural-topic Neural Network, abbreviated GraphSTONE, a GCN model that utilizes topic models of graphs, such that the structural topics capture indicative graph structures broadly from a probabilistic aspect rather than merely a few structures. Specifically, we build topic models upon graphs using anonymous walks and Graph Anchor LDA, an LDA variant that selects significant structural patterns first, so as to alleviate the complexity and generate structural topics efficiently. In addition, we design multi-view GCNs to unify node features and structural topic features and utilize structural topics to guide the aggregation. We evaluate our model through both quantitative and qualitative experiments, where our model exhibits promising performance, high efficiency, and clear interpretability.
翻訳日:2022-11-17 03:41:34 公開日:2020-07-04
# 道路交通流予測のためのグラフモデリング手法

Graph modelling approaches for motorway traffic flow prediction ( http://arxiv.org/abs/2006.14824v2 )

ライセンス: Link先を確認
Traffic flow prediction, particularly in areas that experience highly dynamic flows such as motorways, is a major issue faced in traffic management. Due to increasingly large volumes of data sets being generated every minute, deep learning methods have been used extensively in the latest years for both short and long term prediction. However, such models, despite their efficiency, need large amounts of historical information to be provided, and they take a considerable amount of time and computing resources to train, validate and test. This paper presents two new spatial-temporal approaches for building accurate short-term prediction along a popular motorway in Sydney, by making use of the graph structure of the motorway network (including exits and entries). The methods are built on proximity-based approaches, denoted backtracking and interpolation, which uses the most recent and closest traffic flow information for each of the target counting stations along the motorway. The results indicate that for short-term predictions (less than 10 minutes into the future), the proposed graph-based approaches outperform state-of-the-art deep learning models, such as long-term short memory, convolutional neuronal networks or hybrid models.
翻訳日:2022-11-16 22:08:40 公開日:2020-07-04
# エンコーダ・デコーダを用いたcovid-19肺感染分画法

An encoder-decoder-based method for COVID-19 lung infection segmentation ( http://arxiv.org/abs/2007.00861v2 )

ライセンス: Link先を確認
The novelty of the COVID-19 disease and the speed of spread has created a colossal chaos, impulse among researchers worldwide to exploit all the resources and capabilities to understand and analyze characteristics of the coronavirus in term of the ways it spreads and virus incubation time. For that, the existing medical features like CT and X-ray images are used. For example, CT-scan images can be used for the detection of lung infection. But the challenges of these features such as the quality of the image and infection characteristics limitate the effectiveness of these features. Using artificial intelligence (AI) tools and computer vision algorithms, the accuracy of detection can be more accurate and can help to overcome these issues. This paper proposes a multi-task deep-learning-based method for lung infection segmentation using CT-scan images. Our proposed method starts by segmenting the lung regions that can be infected. Then, segmenting the infections in these regions. Also, to perform a multi-class segmentation the proposed model is trained using the two-stream inputs. The multi-task learning used in this paper allows us to overcome shortage of labeled data. Also, the multi-input stream allows the model to do the learning on many features that can improve the results. To evaluate the proposed method, many features have been used. Also, from the experiments, the proposed method can segment lung infections with a high degree performance even with shortage of data and labeled images. In addition, comparing with the state-of-the-art method our method achieves good performance results.
翻訳日:2022-11-14 14:36:01 公開日:2020-07-04
# MEGデータの深部脳状態分類

Deep brain state classification of MEG data ( http://arxiv.org/abs/2007.00897v2 )

ライセンス: Link先を確認
Neuroimaging techniques have shown to be useful when studying the brain's activity. This paper uses Magnetoencephalography (MEG) data, provided by the Human Connectome Project (HCP), in combination with various deep artificial neural network models to perform brain decoding. More specifically, here we investigate to which extent can we infer the task performed by a subject based on its MEG data. Three models based on compact convolution, combined convolutional and long short-term architecture as well as a model based on multi-view learning that aims at fusing the outputs of the two stream networks are proposed and examined. These models exploit the spatio-temporal MEG data for learning new representations that are used to decode the relevant tasks across subjects. In order to realize the most relevant features of the input signals, two attention mechanisms, i.e. self and global attention, are incorporated in all the models. The experimental results of cross subject multi-class classification on the studied MEG dataset show that the inclusion of attention improves the generalization of the models across subjects.
翻訳日:2022-11-14 13:42:42 公開日:2020-07-04
# 畳み込みネットワークを用いた心電図QRS検出のためのサンプリング周波数の選択

Choosing a sampling frequency for ECG QRS detection using convolutional networks ( http://arxiv.org/abs/2007.02052v1 )

ライセンス: Link先を確認
Automated QRS detection methods depend on the ECG data which is sampled at a certain frequency, irrespective of filter-based traditional methods or convolutional network (CNN) based deep learning methods. These methods require a selection of the sampling frequency at which they operate in the very first place. While working with data from two different datasets, which are sampled at different frequencies, often, data from both the datasets may need to resample at a common target frequency, which may be the frequency of either of the datasets or could be a different one. However, choosing data sampled at a certain frequency may have an impact on the model's generalisation capacity, and complexity. There exist some studies that investigate the effects of ECG sample frequencies on traditional filter-based methods, however, an extensive study of the effect of ECG sample frequency on deep learning-based models (convolutional networks), exploring their generalisability and complexity is yet to be explored. This experimental research investigates the impact of six different sample frequencies (50, 100, 250, 500, 1000, and 2000Hz) on four different convolutional network-based models' generalisability and complexity in order to form a basis to decide on an appropriate sample frequency for the QRS detection task for a particular performance requirement. Intra-database tests report an accuracy improvement no more than approximately 0.6\% from 100Hz to 250Hz and the shorter interquartile range for those two frequencies for all CNN-based models. The findings reveal that convolutional network-based deep learning models are capable of scoring higher levels of detection accuracies on ECG signals sampled at frequencies as low as 100Hz or 250Hz while maintaining lower model complexity (number of trainable parameters and training time).
翻訳日:2022-11-13 13:57:12 公開日:2020-07-04
# CardioLearn:心電図による心疾患検出のためのクラウドディープラーニングサービス

CardioLearn: A Cloud Deep Learning Service for Cardiac Disease Detection from Electrocardiogram ( http://arxiv.org/abs/2007.02165v1 )

ライセンス: Link先を確認
Electrocardiogram (ECG) is one of the most convenient and non-invasive tools for monitoring peoples' heart condition, which can use for diagnosing a wide range of heart diseases, including Cardiac Arrhythmia, Acute Coronary Syndrome, et al. However, traditional ECG disease detection models show substantial rates of misdiagnosis due to the limitations of the abilities of extracted features. Recent deep learning methods have shown significant advantages, but they do not provide publicly available services for those who have no training data or computational resources. In this paper, we demonstrate our work on building, training, and serving such out-of-the-box cloud deep learning service for cardiac disease detection from ECG named CardioLearn. The analytic ability of any other ECG recording devices can be enhanced by connecting to the Internet and invoke our open API. As a practical example, we also design a portable smart hardware device along with an interactive mobile program, which can collect ECG and detect potential cardiac diseases anytime and anywhere.
翻訳日:2022-11-13 13:56:42 公開日:2020-07-04
# 逆変形場を用いた医用画像の病的証拠の解釈

Interpretation of Disease Evidence for Medical Images Using Adversarial Deformation Fields ( http://arxiv.org/abs/2007.01975v1 )

ライセンス: Link先を確認
The high complexity of deep learning models is associated with the difficulty of explaining what evidence they recognize as correlating with specific disease labels. This information is critical for building trust in models and finding their biases. Until now, automated deep learning visualization solutions have identified regions of images used by classifiers, but these solutions are too coarse, too noisy, or have a limited representation of the way images can change. We propose a novel method for formulating and presenting spatial explanations of disease evidence, called deformation field interpretation with generative adversarial networks (DeFI-GAN). An adversarially trained generator produces deformation fields that modify images of diseased patients to resemble images of healthy patients. We validate the method studying chronic obstructive pulmonary disease (COPD) evidence in chest x-rays (CXRs) and Alzheimer's disease (AD) evidence in brain MRIs. When extracting disease evidence in longitudinal data, we show compelling results against a baseline producing difference maps. DeFI-GAN also highlights disease biomarkers not found by previous methods and potential biases that may help in investigations of the dataset and of the adopted learning methods.
翻訳日:2022-11-13 13:55:14 公開日:2020-07-04
# 低光度画像強調のための深部両側網膜

Deep Bilateral Retinex for Low-Light Image Enhancement ( http://arxiv.org/abs/2007.02018v1 )

ライセンス: Link先を確認
Low-light images, i.e. the images captured in low-light conditions, suffer from very poor visibility caused by low contrast, color distortion and significant measurement noise. Low-light image enhancement is about improving the visibility of low-light images. As the measurement noise in low-light images is usually significant yet complex with spatially-varying characteristic, how to handle the noise effectively is an important yet challenging problem in low-light image enhancement. Based on the Retinex decomposition of natural images, this paper proposes a deep learning method for low-light image enhancement with a particular focus on handling the measurement noise. The basic idea is to train a neural network to generate a set of pixel-wise operators for simultaneously predicting the noise and the illumination layer, where the operators are defined in the bilateral space. Such an integrated approach allows us to have an accurate prediction of the reflectance layer in the presence of significant spatially-varying measurement noise. Extensive experiments on several benchmark datasets have shown that the proposed method is very competitive to the state-of-the-art methods, and has significant advantage over others when processing images captured in extremely low lighting conditions.
翻訳日:2022-11-13 13:46:51 公開日:2020-07-04
# 同時分類と追跡による効率的かつ正確な物体検出

Efficient and accurate object detection with simultaneous classification and tracking ( http://arxiv.org/abs/2007.02065v1 )

ライセンス: Link先を確認
Interacting with the environment, such as object detection and tracking, is a crucial ability of mobile robots. Besides high accuracy, efficiency in terms of processing effort and energy consumption are also desirable. To satisfy both requirements, we propose a detection framework based on simultaneous classification and tracking in the point stream. In this framework, a tracker performs data association in sequences of the point cloud, guiding the detector to avoid redundant processing (i.e. classifying already-known objects). For objects whose classification is not sufficiently certain, a fusion model is designed to fuse selected key observations that provide different perspectives across the tracking span. Therefore, performance (accuracy and efficiency of detection) can be enhanced. This method is particularly suitable for detecting and tracking moving objects, a process that would require expensive computations if solved using conventional procedures. Experiments were conducted on the benchmark dataset, and the results showed that the proposed method outperforms original tracking-by-detection approaches in both efficiency and accuracy.
翻訳日:2022-11-13 13:46:14 公開日:2020-07-04
# Speckle2Void: Blind-Spot畳み込みニューラルネットワークを用いた深部自己スーパービジョンSARデスペックリング

Speckle2Void: Deep Self-Supervised SAR Despeckling with Blind-Spot Convolutional Neural Networks ( http://arxiv.org/abs/2007.02075v1 )

ライセンス: Link先を確認
Information extraction from synthetic aperture radar (SAR) images is heavily impaired by speckle noise, hence despeckling is a crucial preliminary step in scene analysis algorithms. The recent success of deep learning envisions a new generation of despeckling techniques that could outperform classical model-based methods. However, current deep learning approaches to despeckling require supervision for training, whereas clean SAR images are impossible to obtain. In the literature, this issue is tackled by resorting to either synthetically speckled optical images, which exhibit different properties with respect to true SAR images, or multi-temporal SAR images, which are difficult to acquire or fuse accurately. In this paper, inspired by recent works on blind-spot denoising networks, we propose a self-supervised Bayesian despeckling method. The proposed method is trained employing only noisy SAR images and can therefore learn features of real SAR images rather than synthetic data. Experiments show that the performance of the proposed approach is very close to the supervised training approach on synthetic data and superior on real data in both quantitative and visual assessments.
翻訳日:2022-11-13 13:45:58 公開日:2020-07-04
# 微細地形図の構造情報を用いた病理組織像の登録

Registration of Histopathogy Images Using Structural Information From Fine Grained Feature Maps ( http://arxiv.org/abs/2007.02078v1 )

ライセンス: Link先を確認
Registration is an important part of many clinical workflows and factually, including information of structures of interest improves registration performance. We propose a novel approach of combining segmentation information in a registration framework using self supervised segmentation feature maps extracted using a pre-trained segmentation network followed by clustering. Using self supervised feature maps enables us to use segmentation information despite the unavailability of manual segmentations. Experimental results show our approach effectively replaces manual segmentation maps and demonstrate the possibility of obtaining state of the art registration performance in real world cases where manual segmentation maps are unavailable.
翻訳日:2022-11-13 13:45:40 公開日:2020-07-04
# マトロイド拘束下サブモジュラー最大化によるマルチセンサの次回のベストビュー計画

Multi-Sensor Next-Best-View Planning as Matroid-Constrained Submodular Maximization ( http://arxiv.org/abs/2007.02084v1 )

ライセンス: Link先を確認
3D scene models are useful in robotics for tasks such as path planning, object manipulation, and structural inspection. We consider the problem of creating a 3D model using depth images captured by a team of multiple robots. Each robot selects a viewpoint and captures a depth image from it, and the images are fused to update the scene model. The process is repeated until a scene model of desired quality is obtained. Next-best-view planning uses the current scene model to select the next viewpoints. The objective is to select viewpoints so that the images captured using them improve the quality of the scene model the most. In this paper, we address next-best-view planning for multiple depth cameras. We propose a utility function that scores sets of viewpoints and avoids overlap between multiple sensors. We show that multi-sensor next-best-view planning with this utility function is an instance of submodular maximization under a matroid constraint. This allows the planning problem to be solved by a polynomial-time greedy algorithm that yields a solution within a constant factor from the optimal. We evaluate the performance of our planning algorithm in simulated experiments with up to 8 sensors, and in real-world experiments using two robot arms equipped with depth cameras.
翻訳日:2022-11-13 13:45:26 公開日:2020-07-04
# ロバストなrgb-tトラッキングのための運動と外観の同時モデリング

Jointly Modeling Motion and Appearance Cues for Robust RGB-T Tracking ( http://arxiv.org/abs/2007.02041v1 )

ライセンス: Link先を確認
In this study, we propose a novel RGB-T tracking framework by jointly modeling both appearance and motion cues. First, to obtain a robust appearance model, we develop a novel late fusion method to infer the fusion weight maps of both RGB and thermal (T) modalities. The fusion weights are determined by using offline-trained global and local multimodal fusion networks, and then adopted to linearly combine the response maps of RGB and T modalities. Second, when the appearance cue is unreliable, we comprehensively take motion cues, i.e., target and camera motions, into account to make the tracker robust. We further propose a tracker switcher to switch the appearance and motion trackers flexibly. Numerous results on three recent RGB-T tracking datasets show that the proposed tracker performs significantly better than other state-of-the-art algorithms.
翻訳日:2022-11-13 13:39:08 公開日:2020-07-04
# 自己校正支援ロバスト射影構造

Self-Calibration Supported Robust Projective Structure-from-Motion ( http://arxiv.org/abs/2007.02045v1 )

ライセンス: Link先を確認
Typical Structure-from-Motion (SfM) pipelines rely on finding correspondences across images, recovering the projective structure of the observed scene and upgrading it to a metric frame using camera self-calibration constraints. Solving each problem is mainly carried out independently from the others. For instance, camera self-calibration generally assumes correct matches and a good projective reconstruction have been obtained. In this paper, we propose a unified SfM method, in which the matching process is supported by self-calibration constraints. We use the idea that good matches should yield a valid calibration. In this process, we make use of the Dual Image of Absolute Quadric projection equations within a multiview correspondence framework, in order to obtain robust matching from a set of putative correspondences. The matching process classifies points as inliers or outliers, which is learned in an unsupervised manner using a deep neural network. Together with theoretical reasoning why the self-calibration constraints are necessary, we show experimental results demonstrating robust multiview matching and accurate camera calibration by exploiting these constraints.
翻訳日:2022-11-13 13:38:52 公開日:2020-07-04
# クロススセナリオ3次元ポーズ推定のための推定段階最適化

Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation ( http://arxiv.org/abs/2007.02054v1 )

ライセンス: Link先を確認
Existing 3D human pose estimation models suffer performance drop when applying to new scenarios with unseen poses due to their limited generalizability. In this work, we propose a novel framework, Inference Stage Optimization (ISO), for improving the generalizability of 3D pose models when source and target data come from different pose distributions. Our main insight is that the target data, even though not labeled, carry valuable priors about their underlying distribution. To exploit such information, the proposed ISO performs geometry-aware self-supervised learning (SSL) on each single target instance and updates the 3D pose model before making prediction. In this way, the model can mine distributional knowledge about the target scenario and quickly adapt to it with enhanced generalization performance. In addition, to handle sequential target data, we propose an online mode for implementing our ISO framework via streaming the SSL, which substantially enhances its effectiveness. We systematically analyze why and how our ISO framework works on diverse benchmarks under cross-scenario setup. Remarkably, it yields new state-of-the-art of 83.6% 3D PCK on MPI-INF-3DHP, improving upon the previous best result by 9.7%. Code will be released.
翻訳日:2022-11-13 13:38:32 公開日:2020-07-04
# 細粒度認識のためのフィッシャーベクトル符号化のエンドツーエンド学習

End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained Recognition ( http://arxiv.org/abs/2007.02080v1 )

ライセンス: Link先を確認
Part-based approaches for fine-grained recognition do not show the expected performance gain over global methods, although being able to explicitly focus on small details that are relevant for distinguishing highly similar classes. We assume that part-based methods suffer from a missing representation of local features, which is invariant to the order of parts and can handle a varying number of visible parts appropriately. The order of parts is artificial and often only given by ground-truth annotations, whereas viewpoint variations and occlusions result in parts that are not observable. Therefore, we propose integrating a Fisher vector encoding of part features into convolutional neural networks. The parameters for this encoding are estimated jointly with those of the neural network in an end-to-end manner. Our approach improves state-of-the-art accuracies for bird species classification on CUB-200-2011 from 90.40\% to 90.95\%, on NA-Birds from 89.20\% to 90.30\%, and on Birdsnap from 84.30\% to 86.97\%.
翻訳日:2022-11-13 13:37:49 公開日:2020-07-04
# 点群における3次元物体検出のための局所グリッドレンダリングネットワーク

Local Grid Rendering Networks for 3D Object Detection in Point Clouds ( http://arxiv.org/abs/2007.02099v1 )

ライセンス: Link先を確認
The performance of 3D object detection models over point clouds highly depends on their capability of modeling local geometric patterns. Conventional point-based models exploit local patterns through a symmetric function (e.g. max pooling) or based on graphs, which easily leads to loss of fine-grained geometric structures. Regarding capturing spatial patterns, CNNs are powerful but it would be computationally costly to directly apply convolutions on point data after voxelizing the entire point clouds to a dense regular 3D grid. In this work, we aim to improve performance of point-based models by enhancing their pattern learning ability through leveraging CNNs while preserving computational efficiency. We propose a novel and principled Local Grid Rendering (LGR) operation to render the small neighborhood of a subset of input points into a low-resolution 3D grid independently, which allows small-size CNNs to accurately model local patterns and avoids convolutions over a dense grid to save computation cost. With the LGR operation, we introduce a new generic backbone called LGR-Net for point cloud feature extraction with simple design and high efficiency. We validate LGR-Net for 3D object detection on the challenging ScanNet and SUN RGB-D datasets. It advances state-of-the-art results significantly by 5.5 and 4.5 mAP, respectively, with only slight increased computation overhead.
翻訳日:2022-11-13 13:37:25 公開日:2020-07-04
# SplitFusion:非デジタルシーンの同時追跡とマッピング

SplitFusion: Simultaneous Tracking and Mapping for Non-Rigid Scenes ( http://arxiv.org/abs/2007.02108v1 )

ライセンス: Link先を確認
We present SplitFusion, a novel dense RGB-D SLAM framework that simultaneously performs tracking and dense reconstruction for both rigid and non-rigid components of the scene. SplitFusion first adopts deep learning based semantic instant segmentation technique to split the scene into rigid or non-rigid surfaces. The split surfaces are independently tracked via rigid or non-rigid ICP and reconstructed through incremental depth map fusion. Experimental results show that the proposed approach can provide not only accurate environment maps but also well-reconstructed non-rigid targets, e.g. the moving humans.
翻訳日:2022-11-13 13:37:04 公開日:2020-07-04
# 人体知覚による顔のアンチ・スプーフィング

Face Anti-Spoofing with Human Material Perception ( http://arxiv.org/abs/2007.02157v1 )

ライセンス: Link先を確認
Face anti-spoofing (FAS) plays a vital role in securing the face recognition systems from presentation attacks. Most existing FAS methods capture various cues (e.g., texture, depth and reflection) to distinguish the live faces from the spoofing faces. All these cues are based on the discrepancy among physical materials (e.g., skin, glass, paper and silicone). In this paper we rephrase face anti-spoofing as a material recognition problem and combine it with classical human material perception [1], intending to extract discriminative and robust features for FAS. To this end, we propose the Bilateral Convolutional Networks (BCN), which is able to capture intrinsic material-based patterns via aggregating multi-level bilateral macro- and micro- information. Furthermore, Multi-level Feature Refinement Module (MFRM) and multi-head supervision are utilized to learn more robust features. Comprehensive experiments are performed on six benchmark datasets, and the proposed method achieves superior performance on both intra- and cross-dataset testings. One highlight is that we achieve overall 11.3$\pm$9.5\% EER for cross-type testing in SiW-M dataset, which significantly outperforms previous results. We hope this work will facilitate future cooperation between FAS and material communities.
翻訳日:2022-11-13 13:36:43 公開日:2020-07-04
# 多次元畳み込みニューラルネットワークによる風速予測

Wind speed prediction using multidimensional convolutional neural networks ( http://arxiv.org/abs/2007.12567v1 )

ライセンス: Link先を確認
Accurate wind speed forecasting is of great importance for many economic, business and management sectors. This paper introduces a new model based on convolutional neural networks (CNNs) for wind speed prediction tasks. In particular, we show that compared to classical CNN-based models, the proposed model is able to better characterise the spatio-temporal evolution of the wind data by learning the underlying complex input-output relationships from multiple dimensions (views) of the input data. The proposed model exploits the spatio-temporal multivariate multidimensional historical weather data for learning new representations used for wind forecasting. We conduct experiments on two real-life weather datasets. The datasets are measurements from cities in Denmark and in the Netherlands. The proposed model is compared with traditional 2- and 3-dimensional CNN models, a 2D-CNN model with an attention layer and a 2D-CNN model equipped with upscaling and depthwise separable convolutions.
翻訳日:2022-11-13 13:36:19 公開日:2020-07-04
# 低ランク核融合に基づくマルチモーダルシーケンス変換器

Low Rank Fusion based Transformers for Multimodal Sequences ( http://arxiv.org/abs/2007.02038v1 )

ライセンス: Link先を確認
Our senses individually work in a coordinated fashion to express our emotional intentions. In this work, we experiment with modeling modality-specific sensory signals to attend to our latent multimodal emotional intentions and vice versa expressed via low-rank multimodal fusion and multimodal transformers. The low-rank factorization of multimodal fusion amongst the modalities helps represent approximate multiplicative latent signal interactions. Motivated by the work of~\cite{tsai2019MULT} and~\cite{Liu_2018}, we present our transformer-based cross-fusion architecture without any over-parameterization of the model. The low-rank fusion helps represent the latent signal interactions while the modality-specific attention helps focus on relevant parts of the signal. We present two methods for the Multimodal Sentiment and Emotion Recognition results on CMU-MOSEI, CMU-MOSI, and IEMOCAP datasets and show that our models have lesser parameters, train faster and perform comparably to many larger fusion-based architectures.
翻訳日:2022-11-13 13:29:29 公開日:2020-07-04
# フェザー群集の鳥たち:言語モデルの違いによる主観的ニュース検出

Birds of a Feather Flock Together: Satirical News Detection via Language Model Differentiation ( http://arxiv.org/abs/2007.02164v1 )

ライセンス: Link先を確認
Satirical news is regularly shared in modern social media because it is entertaining with smartly embedded humor. However, it can be harmful to society because it can sometimes be mistaken as factual news, due to its deceptive character. We found that in satirical news, the lexical and pragmatical attributes of the context are the key factors in amusing the readers. In this work, we propose a method that differentiates the satirical news and true news. It takes advantage of satirical writing evidence by leveraging the difference between the prediction loss of two language models, one trained on true news and the other on satirical news, when given a new news article. We compute several statistical metrics of language model prediction loss as features, which are then used to conduct downstream classification. The proposed method is computationally effective because the language models capture the language usage differences between satirical news documents and traditional news documents, and are sensitive when applied to documents outside their domains.
翻訳日:2022-11-13 13:29:11 公開日:2020-07-04
# 生涯信頼性アウェアニューロモルフィックコンピューティングの一症例

A Case for Lifetime Reliability-Aware Neuromorphic Computing ( http://arxiv.org/abs/2007.02210v1 )

ライセンス: Link先を確認
Neuromorphic computing with non-volatile memory (NVM) can significantly improve performance and lower energy consumption of machine learning tasks implemented using spike-based computations and bio-inspired learning algorithms. High voltages required to operate certain NVMs such as phase-change memory (PCM) can accelerate aging in a neuron's CMOS circuit, thereby reducing the lifetime of neuromorphic hardware. In this work, we evaluate the long-term, i.e., lifetime reliability impact of executing state-of-the-art machine learning tasks on a neuromorphic hardware, considering failure models such as negative bias temperature instability (NBTI) and time-dependent dielectric breakdown (TDDB). Based on such formulation, we show the reliability-performance trade-off obtained due to periodic relaxation of neuromorphic circuits, i.e., a stop-and-go style of neuromorphic computing.
翻訳日:2022-11-13 13:28:54 公開日:2020-07-04
# DRDr: Mask R-CNN を用いた糖尿病網膜症による解離および微小動脈瘤の自動マスキング

DRDr: Automatic Masking of Exudates and Microaneurysms Caused By Diabetic Retinopathy Using Mask R-CNN and Transfer Learning ( http://arxiv.org/abs/2007.02026v1 )

ライセンス: Link先を確認
This paper addresses the problem of identifying two main types of lesions - Exudates and Microaneurysms - caused by Diabetic Retinopathy (DR) in the eyes of diabetic patients. We make use of Convolutional Neural Networks (CNNs) and Transfer Learning to locate and generate high-quality segmentation mask for each instance of the lesion that can be found in the patients' fundus images. We create our normalized database out of e-ophtha EX and e-ophtha MA and tweak Mask R-CNN to detect small lesions. Moreover, we employ data augmentation and the pre-trained weights of ResNet101 to compensate for our small dataset. Our model achieves promising test mAP of 0.45, altogether showing that it can aid clinicians and ophthalmologist in the process of detecting and treating the infamous DR.
翻訳日:2022-11-13 13:28:16 公開日:2020-07-04
# 形状認識型メタラーニングによる非知覚領域への前立腺mriセグメンテーションの一般化

Shape-aware Meta-learning for Generalizing Prostate MRI Segmentation to Unseen Domains ( http://arxiv.org/abs/2007.02035v1 )

ライセンス: Link先を確認
Model generalization capacity at domain shift (e.g., various imaging protocols and scanners) is crucial for deep learning methods in real-world clinical deployment. This paper tackles the challenging problem of domain generalization, i.e., learning a model from multi-domain source data such that it can directly generalize to an unseen target domain. We present a novel shape-aware meta-learning scheme to improve the model generalization in prostate MRI segmentation. Our learning scheme roots in the gradient-based meta-learning, by explicitly simulating domain shift with virtual meta-train and meta-test during training. Importantly, considering the deficiencies encountered when applying a segmentation model to unseen domains (i.e., incomplete shape and ambiguous boundary of the prediction masks), we further introduce two complementary loss objectives to enhance the meta-optimization, by particularly encouraging the shape compactness and shape smoothness of the segmentations under simulated domain shift. We evaluate our method on prostate MRI data from six different institutions with distribution shifts acquired from public datasets. Experimental results show that our approach outperforms many state-of-the-art generalization methods consistently across all six settings of unseen domains.
翻訳日:2022-11-13 13:27:59 公開日:2020-07-04
# マルチモーダルビデオ質問応答のためのモーダリティシフト型注意ネットワーク

Modality Shifting Attention Network for Multi-modal Video Question Answering ( http://arxiv.org/abs/2007.02036v1 )

ライセンス: Link先を確認
This paper considers a network referred to as Modality Shifting Attention Network (MSAN) for Multimodal Video Question Answering (MVQA) task. MSAN decomposes the task into two sub-tasks: (1) localization of temporal moment relevant to the question, and (2) accurate prediction of the answer based on the localized moment. The modality required for temporal localization may be different from that for answer prediction, and this ability to shift modality is essential for performing the task. To this end, MSAN is based on (1) the moment proposal network (MPN) that attempts to locate the most appropriate temporal moment from each of the modalities, and also on (2) the heterogeneous reasoning network (HRN) that predicts the answer using an attention mechanism on both modalities. MSAN is able to place importance weight on the two modalities for each sub-task using a component referred to as Modality Importance Modulation (MIM). Experimental results show that MSAN outperforms previous state-of-the-art by achieving 71.13\% test accuracy on TVQA benchmark dataset. Extensive ablation studies and qualitative analysis are conducted to validate various components of the network.
翻訳日:2022-11-13 13:27:37 公開日:2020-07-04
# 大規模候補溶液集合からの遅延グレディハイパーボリュームサブセットの選択

Lazy Greedy Hypervolume Subset Selection from Large Candidate Solution Sets ( http://arxiv.org/abs/2007.02050v1 )

ライセンス: Link先を確認
Subset selection is a popular topic in recent years and a number of subset selection methods have been proposed. Among those methods, hypervolume subset selection is widely used. Greedy hypervolume subset selection algorithms can achieve good approximations to the optimal subset. However, when the candidate set is large (e.g., an unbounded external archive with a large number of solutions), the algorithm is very time-consuming. In this paper, we propose a new lazy greedy algorithm exploiting the submodular property of the hypervolume indicator. The core idea is to avoid unnecessary hypervolume contribution calculation when finding the solution with the largest contribution. Experimental results show that the proposed algorithm is hundreds of times faster than the original greedy inclusion algorithm and several times faster than the fastest known greedy inclusion algorithm on many test problems.
翻訳日:2022-11-13 13:20:56 公開日:2020-07-04
# Replica Exchange Langevin Diffusionによる非凸学習の高速化

Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion ( http://arxiv.org/abs/2007.01990v1 )

ライセンス: Link先を確認
Langevin diffusion is a powerful method for nonconvex optimization, which enables the escape from local minima by injecting noise into the gradient. In particular, the temperature parameter controlling the noise level gives rise to a tradeoff between ``global exploration'' and ``local exploitation'', which correspond to high and low temperatures. To attain the advantages of both regimes, we propose to use replica exchange, which swaps between two Langevin diffusions with different temperatures. We theoretically analyze the acceleration effect of replica exchange from two perspectives: (i) the convergence in \chi^2-divergence, and (ii) the large deviation principle. Such an acceleration effect allows us to faster approach the global minima. Furthermore, by discretizing the replica exchange Langevin diffusion, we obtain a discrete-time algorithm. For such an algorithm, we quantify its discretization error in theory and demonstrate its acceleration effect in practice.
翻訳日:2022-11-13 13:20:20 公開日:2020-07-04
# RDP-GAN: R'enyi-differential Privacy based Generative Adversarial Network

RDP-GAN: A R\'enyi-Differential Privacy based Generative Adversarial Network ( http://arxiv.org/abs/2007.02056v1 )

ライセンス: Link先を確認
Generative adversarial network (GAN) has attracted increasing attention recently owing to its impressive ability to generate realistic samples with high privacy protection. Without directly interactive with training examples, the generative model can be fully used to estimate the underlying distribution of an original dataset while the discriminative model can examine the quality of the generated samples by comparing the label values with the training examples. However, when GANs are applied on sensitive or private training examples, such as medical or financial records, it is still probable to divulge individuals' sensitive and private information. To mitigate this information leakage and construct a private GAN, in this work we propose a R\'enyi-differentially private-GAN (RDP-GAN), which achieves differential privacy (DP) in a GAN by carefully adding random noises on the value of the loss function during training. Moreover, we derive the analytical results of the total privacy loss under the subsampling method and cumulated iterations, which show its effectiveness on the privacy budget allocation. In addition, in order to mitigate the negative impact brought by the injecting noise, we enhance the proposed algorithm by adding an adaptive noise tuning step, which will change the volume of added noise according to the testing accuracy. Through extensive experimental results, we verify that the proposed algorithm can achieve a better privacy level while producing high-quality samples compared with a benchmark DP-GAN scheme based on noise perturbation on training gradients.
翻訳日:2022-11-13 13:20:08 公開日:2020-07-04
# 深層森林を用いた急性腎損傷の誘発における薬物・薬物・薬物・薬物の相互作用の解明

Discovering Drug-Drug and Drug-Disease Interactions Inducing Acute Kidney Injury Using Deep Rule Forests ( http://arxiv.org/abs/2007.02103v1 )

ライセンス: Link先を確認
Patients with Acute Kidney Injury (AKI) increase mortality, morbidity, and long-term adverse events. Therefore, early identification of AKI may improve renal function recovery, decrease comorbidities, and further improve patients' survival. To control certain risk factors and develop targeted prevention strategies are important to reduce the risk of AKI. Drug-drug interactions and drug-disease interactions are critical issues for AKI. Typical statistical approaches cannot handle the complexity of drug-drug and drug-disease interactions. In this paper, we propose a novel learning algorithm, Deep Rule Forests (DRF), which discovers rules from multilayer tree models as the combinations of drug usages and disease indications to help identify such interactions. We found that several disease and drug usages are considered having significant impact on the occurrence of AKI. Our experimental results also show that the DRF model performs comparatively better than typical tree-based and other state-of-the-art algorithms in terms of prediction accuracy and model interpretability.
翻訳日:2022-11-13 13:19:43 公開日:2020-07-04
# 学習と制御のためのスケーラブルな微分物理学

Scalable Differentiable Physics for Learning and Control ( http://arxiv.org/abs/2007.02168v1 )

ライセンス: Link先を確認
Differentiable physics is a powerful approach to learning and control problems that involve physical objects and environments. While notable progress has been made, the capabilities of differentiable physics solvers remain limited. We develop a scalable framework for differentiable physics that can support a large number of objects and their interactions. To accommodate objects with arbitrary geometry and topology, we adopt meshes as our representation and leverage the sparsity of contacts for scalable differentiable collision handling. Collisions are resolved in localized regions to minimize the number of optimization variables even when the number of simulated objects is high. We further accelerate implicit differentiation of optimization with nonlinear constraints. Experiments demonstrate that the presented framework requires up to two orders of magnitude less memory and computation in comparison to recent particle-based methods. We further validate the approach on inverse problems and control scenarios, where it outperforms derivative-free and model-free baselines by at least an order of magnitude.
翻訳日:2022-11-13 13:18:55 公開日:2020-07-04
# コロナウイルス知識グラフの1例

Coronavirus Knowledge Graph: A Case Study ( http://arxiv.org/abs/2007.10287v1 )

ライセンス: Link先を確認
The emergence of the novel COVID-19 pandemic has had a significant impact on global healthcare and the economy over the past few months. The virus's rapid widespread has led to a proliferation in biomedical research addressing the pandemic and its related topics. One of the essential Knowledge Discovery tools that could help the biomedical research community understand and eventually find a cure for COVID-19 are Knowledge Graphs. The CORD-19 dataset is a collection of publicly available full-text research articles that have been recently published on COVID-19 and coronavirus topics. Here, we use several Machine Learning, Deep Learning, and Knowledge Graph construction and mining techniques to formalize and extract insights from the PubMed dataset and the CORD-19 dataset to identify COVID-19 related experts and bio-entities. Besides, we suggest possible techniques to predict related diseases, drug candidates, gene, gene mutations, and related compounds as part of a systematic effort to apply Knowledge Discovery methods to help biomedical researchers tackle the pandemic.
翻訳日:2022-11-13 13:11:57 公開日:2020-07-04
# Lale: 一貫性のある自動機械学習

Lale: Consistent Automated Machine Learning ( http://arxiv.org/abs/2007.01977v1 )

ライセンス: Link先を確認
Automated machine learning makes it easier for data scientists to develop pipelines by searching over possible choices for hyperparameters, algorithms, and even pipeline topologies. Unfortunately, the syntax for automated machine learning tools is inconsistent with manual machine learning, with each other, and with error checks. Furthermore, few tools support advanced features such as topology search or higher-order operators. This paper introduces Lale, a library of high-level Python interfaces that simplifies and unifies automated machine learning in a consistent way.
翻訳日:2022-11-13 13:11:40 公開日:2020-07-04
# 任意決定論的Tsetlin機械学習のための多段階有限状態オートマトン

A Novel Multi-Step Finite-State Automaton for Arbitrarily Deterministic Tsetlin Machine Learning ( http://arxiv.org/abs/2007.02114v1 )

ライセンス: Link先を確認
Due to the high energy consumption and scalability challenges of deep learning, there is a critical need to shift research focus towards dealing with energy consumption constraints. Tsetlin Machines (TMs) are a recent approach to machine learning that has demonstrated significantly reduced energy usage compared to neural networks alike, while performing competitively accuracy-wise on several benchmarks. However, TMs rely heavily on energy-costly random number generation to stochastically guide a team of Tsetlin Automata to a Nash Equilibrium of the TM game. In this paper, we propose a novel finite-state learning automaton that can replace the Tsetlin Automata in TM learning, for increased determinism. The new automaton uses multi-step deterministic state jumps to reinforce sub-patterns. Simultaneously, flipping a coin to skip every $d$'th state update ensures diversification by randomization. The $d$-parameter thus allows the degree of randomization to be finely controlled. E.g., $d=1$ makes every update random and $d=\infty$ makes the automaton completely deterministic. Our empirical results show that, overall, only substantial degrees of determinism reduces accuracy. Energy-wise, random number generation constitutes switching energy consumption of the TM, saving up to 11 mW power for larger datasets with high $d$ values. We can thus use the new $d$-parameter to trade off accuracy against energy consumption, to facilitate low-energy machine learning.
翻訳日:2022-11-13 13:11:32 公開日:2020-07-04
# 限られた視野でチェスをする

Playing Chess with Limited Look Ahead ( http://arxiv.org/abs/2007.02130v1 )

ライセンス: Link先を確認
We have seen numerous machine learning methods tackle the game of chess over the years. However, one common element in these works is the necessity of a finely optimized look ahead algorithm. The particular interest of this research lies with creating a chess engine that is highly capable, but restricted in its look ahead depth. We train a deep neural network to serve as a static evaluation function, which is accompanied by a relatively simple look ahead algorithm. We show that our static evaluation function has encoded some semblance of look ahead knowledge, and is comparable to classical evaluation functions. The strength of our chess engine is assessed by comparing its proposed moves against those proposed by Stockfish. We show that, despite strict restrictions on look ahead depth, our engine recommends moves of equal strength in roughly $83\%$ of our sample positions.
翻訳日:2022-11-13 13:11:10 公開日:2020-07-04
# 競合型連想分類器の構築

Building a Competitive Associative Classifier ( http://arxiv.org/abs/2007.01972v1 )

ライセンス: Link先を確認
With the huge success of deep learning, other machine learning paradigms have had to take back seat. Yet other models, particularly rule-based, are more readable and explainable and can even be competitive when labelled data is not abundant. However, most of the existing rule-based classifiers suffer from the production of a large number of classification rules, affecting the model readability. This hampers the classification accuracy as noisy rules might not add any useful informationfor classification and also lead to longer classification time. In this study, we propose SigD2 which uses a novel, two-stage pruning strategy which prunes most of the noisy, redundant and uninteresting rules and makes the classification model more accurate and readable. To make SigDirect more competitive with the most prevalent but uninterpretable machine learning-based classifiers like neural networks and support vector machines, we propose bagging and boosting on the ensemble of the SigDirect classifier. The results of the proposed algorithms are quite promising and we are able to obtain a minimal set of statistically significant rules for classification without jeopardizing the classification accuracy. We use 15 UCI datasets and compare our approach with eight existing systems.The SigD2 and boosted SigDirect (ACboost) ensemble model outperform various state-of-the-art classifiers not only in terms of classification accuracy but also in terms of the number of rules.
翻訳日:2022-11-13 13:10:56 公開日:2020-07-04
# 関係データの表現のためのネスト部分空間アレンジメント

Nested Subspace Arrangement for Representation of Relational Data ( http://arxiv.org/abs/2007.02007v1 )

ライセンス: Link先を確認
Studies on acquiring appropriate continuous representations of discrete objects, such as graphs and knowledge base data, have been conducted by many researchers in the field of machine learning. In this study, we introduce Nested SubSpace (NSS) arrangement, a comprehensive framework for representation learning. We show that existing embedding techniques can be regarded as special cases of the NSS arrangement. Based on the concept of the NSS arrangement, we implement a Disk-ANChor ARrangement (DANCAR), a representation learning method specialized to reproducing general graphs. Numerical experiments have shown that DANCAR has successfully embedded WordNet in ${\mathbb R}^{20}$ with an F1 score of 0.993 in the reconstruction task. DANCAR is also suitable for visualization in understanding the characteristics of graphs.
翻訳日:2022-11-13 13:10:31 公開日:2020-07-04
# 単純かつ深いグラフ畳み込みネットワーク

Simple and Deep Graph Convolutional Networks ( http://arxiv.org/abs/2007.02133v1 )

ライセンス: Link先を確認
Graph convolutional networks (GCNs) are a powerful deep learning approach for graph-structured data. Recently, GCNs and subsequent variants have shown superior performance in various application areas on real-world datasets. Despite their success, most of the current GCN models are shallow, due to the {\em over-smoothing} problem. In this paper, we study the problem of designing and analyzing deep graph convolutional networks. We propose the GCNII, an extension of the vanilla GCN model with two simple yet effective techniques: {\em Initial residual} and {\em Identity mapping}. We provide theoretical and empirical evidence that the two techniques effectively relieves the problem of over-smoothing. Our experiments show that the deep GCNII model outperforms the state-of-the-art methods on various semi- and full-supervised tasks. Code is available at https://github.com/chennnM/GCNII .
翻訳日:2022-11-13 13:10:21 公開日:2020-07-04
# 一般用途による強化学習のための変分ポリシー勾配法

Variational Policy Gradient Method for Reinforcement Learning with General Utilities ( http://arxiv.org/abs/2007.02151v1 )

ライセンス: Link先を確認
In recent years, reinforcement learning (RL) systems with general goals beyond a cumulative sum of rewards have gained traction, such as in constrained problems, exploration, and acting upon prior experiences. In this paper, we consider policy optimization in Markov Decision Problems, where the objective is a general concave utility function of the state-action occupancy measure, which subsumes several of the aforementioned examples as special cases. Such generality invalidates the Bellman equation. As this means that dynamic programming no longer works, we focus on direct policy search. Analogously to the Policy Gradient Theorem \cite{sutton2000policy} available for RL with cumulative rewards, we derive a new Variational Policy Gradient Theorem for RL with general utilities, which establishes that the parametrized policy gradient may be obtained as the solution of a stochastic saddle point problem involving the Fenchel dual of the utility function. We develop a variational Monte Carlo gradient estimation algorithm to compute the policy gradient based on sample paths. We prove that the variational policy gradient scheme converges globally to the optimal policy for the general objective, though the optimization problem is nonconvex. We also establish its rate of convergence of the order $O(1/t)$ by exploiting the hidden convexity of the problem, and proves that it converges exponentially when the problem admits hidden strong convexity. Our analysis applies to the standard RL problem with cumulative rewards as a special case, in which case our result improves the available convergence rate.
翻訳日:2022-11-13 13:10:08 公開日:2020-07-04
# DNNロバスト性向上のための正規化間の接続について

On Connections between Regularizations for Improving DNN Robustness ( http://arxiv.org/abs/2007.02209v1 )

ライセンス: Link先を確認
This paper analyzes regularization terms proposed recently for improving the adversarial robustness of deep neural networks (DNNs), from a theoretical point of view. Specifically, we study possible connections between several effective methods, including input-gradient regularization, Jacobian regularization, curvature regularization, and a cross-Lipschitz functional. We investigate them on DNNs with general rectified linear activations, which constitute one of the most prevalent families of models for image classification and a host of other machine learning applications. We shed light on essential ingredients of these regularizations and re-interpret their functionality. Through the lens of our study, more principled and efficient regularizations can possibly be invented in the near future.
翻訳日:2022-11-13 13:03:10 公開日:2020-07-04
# 強化学習における正規化因子としての割引因子

Discount Factor as a Regularizer in Reinforcement Learning ( http://arxiv.org/abs/2007.02040v1 )

ライセンス: Link先を確認
Specifying a Reinforcement Learning (RL) task involves choosing a suitable planning horizon, which is typically modeled by a discount factor. It is known that applying RL algorithms with a lower discount factor can act as a regularizer, improving performance in the limited data regime. Yet the exact nature of this regularizer has not been investigated. In this work, we fill in this gap. For several Temporal-Difference (TD) learning methods, we show an explicit equivalence between using a reduced discount factor and adding an explicit regularization term to the algorithm's loss. Motivated by the equivalence, we empirically study this technique compared to standard $L_2$ regularization by extensive experiments in discrete and continuous domains, using tabular and functional representations. Our experiments suggest the regularization effectiveness is strongly related to properties of the available data, such as size, distribution, and mixing rate.
翻訳日:2022-11-13 13:02:23 公開日:2020-07-04
# Neuro-Symbolic Generative Art : 予備研究

Neuro-Symbolic Generative Art: A Preliminary Study ( http://arxiv.org/abs/2007.02171v1 )

ライセンス: Link先を確認
There are two classes of generative art approaches: neural, where a deep model is trained to generate samples from a data distribution, and symbolic or algorithmic, where an artist designs the primary parameters and an autonomous system generates samples within these constraints. In this work, we propose a new hybrid genre: neuro-symbolic generative art. As a preliminary study, we train a generative deep neural network on samples from the symbolic approach. We demonstrate through human studies that subjects find the final artifacts and the creation process using our neuro-symbolic approach to be more creative than the symbolic approach 61% and 82% of the time respectively.
翻訳日:2022-11-13 13:01:57 公開日:2020-07-04
# dessilbi:差分包含経路による深層ネットワークの構造スパース性の検討

DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths ( http://arxiv.org/abs/2007.02010v1 )

ライセンス: Link先を確認
Over-parameterization is ubiquitous nowadays in training neural networks to benefit both optimization in seeking global optima and generalization in reducing prediction error. However, compressive networks are desired in many real world applications and direct training of small networks may be trapped in local optima. In this paper, instead of pruning or distilling over-parameterized models to compressive ones, we propose a new approach based on differential inclusions of inverse scale spaces. Specifically, it generates a family of models from simple to complex ones that couples a pair of parameters to simultaneously train over-parameterized deep models and structural sparsity on weights of fully connected and convolutional layers. Such a differential inclusion scheme has a simple discretization, proposed as Deep structurally splitting Linearized Bregman Iteration (DessiLBI), whose global convergence analysis in deep learning is established that from any initializations, algorithmic iterations converge to a critical point of empirical risks. Experimental evidence shows that DessiLBI achieve comparable and even better performance than the competitive optimizers in exploring the structural sparsity of several widely used backbones on the benchmark datasets. Remarkably, with early stopping, DessiLBI unveils "winning tickets" in early epochs: the effective sparse structure with comparable test accuracy to fully trained over-parameterized models.
翻訳日:2022-11-13 13:00:57 公開日:2020-07-04