kenhktsui's picture
Add SetFit model
2067052 verified
metadata
base_model: sentence-transformers/paraphrase-mpnet-base-v2
library_name: setfit
metrics:
  - f1
pipeline_tag: text-classification
tags:
  - setfit
  - sentence-transformers
  - text-classification
  - generated_from_setfit_trainer
widget:
  - text: >-
      Title: "Deep Residual Learning for Image Recognition".

      Abstract: In this paper, we propose a new deep residual learning framework
      for image classification. We introduce a novel residual block architecture
      that learns to represent high-level features in an image. Our approach is
      based on the idea of residual learning, where the network learns to
      represent the difference between the input and the output of a layer,
      rather than learning to represent the output directly. We evaluate our
      approach on several benchmark datasets, including ImageNet and CIFAR-10,
      and show that it achieves state-of-the-art performance. Our results
      demonstrate the effectiveness of residual learning for image
      classification, and show that it can be used to improve the performance of
      deep neural networks. We also provide a detailed analysis of the residual
      block architecture, and show how it can be used to improve the performance
      of other deep learning models. This paper provides a comprehensive
      overview of the residual learning framework, and demonstrates its
      effectiveness for image classification tasks.
  - text: >-
      Let G be a finite group and let V be a finite-dimensional representation
      of G over an algebraically closed field k. We say that V is a
      representation of G in characteristic zero if the characteristic of k is
      zero. In this paper, we investigate the structure of the representation
      ring R(G) of a finite group G in characteristic zero. We show that R(G) is
      isomorphic to the group ring k[G] if and only if G is a cyclic group.
      Furthermore, we provide a characterization of the representation rings of
      finite abelian groups in terms of their irreducible representations. Our
      results have implications for the study of the representation theory of
      finite groups in characteristic zero.
  - text: >-
      Denotational Semantics of Programming Languages: A Survey


      Abstract: Denotational semantics is a branch of programming language
      theory that focuses on the meaning of programming languages. In this
      survey, we provide an overview of the key concepts and results in
      denotational semantics, including the use of domain theory and categorical
      semantics. We also discuss the relationship between denotational semantics
      and other areas of programming language theory, such as operational
      semantics and axiomatic semantics.


      Introduction


      Denotational semantics is a mathematical approach to understanding the
      meaning of programming languages. It is based on the idea that a
      programming language can be viewed as a mapping from a set of programs to
      a set of meanings, where the meanings are represented as mathematical
      objects. The key concept in denotational semantics is the notion of a
      denotation, which is a function that maps a program to its meaning.


      Domain Theory


      Domain theory is a branch of mathematics that provides a framework for
      understanding the notion of a denotation. It is based on the idea that a
      denotation is a function from a set of programs to a set of meanings,
      where the meanings are represented as elements of a domain. The key
      concept in domain theory is the notion of a continuous function, which is
      a function that preserves the order relation on the domain.


      Categorical Semantics


      Categorical semantics is a branch of mathematics that provides a framework
      for understanding the notion of a denotation in a categorical setting. It
      is based on the idea that a denotation is a function from a set of
      programs to a set of meanings, where the meanings are represented as
      objects in a category. The key concept in categorical semantics is the
      notion of a functor, which is a function that preserves the morphisms in
      the category.


      Conclusion


      In this survey, we have provided an overview of the key concepts and
      results in denotational semantics. We have also discussed the relationship
      between denotational semantics and other areas of programming language
      theory. The results presented in this survey demonstrate the importance of
      denotational semantics in understanding the meaning of programming
      languages.
  - text: >-
      A Novel Robust Control Approach for Uncertain Systems with Time-Varying
      Delays


      Abstract: This paper presents a new robust control method for uncertain
      systems with time-varying delays. The proposed approach combines the
      advantages of model predictive control (MPC) and sliding mode control
      (SMC) to achieve robust stability and performance. The MPC algorithm is
      used to predict the future behavior of the system, while the SMC algorithm
      is employed to reject disturbances and uncertainties. The stability and
      performance of the proposed controller are analyzed using Lyapunov theory
      and simulation results. The effectiveness of the proposed approach is
      demonstrated through numerical examples and comparisons with existing
      methods.


      Keywords: Robust control, uncertain systems, time-varying delays, model
      predictive control, sliding mode control, Lyapunov theory.
  - text: >-
      A Novel Compiler Framework for Parallel Computing: Design and
      Implementation


      Abstract: With the increasing demand for high-performance computing,
      parallel computing has become a crucial aspect of modern computing
      systems. However, the complexity of parallel programming models and the
      lack of efficient compilation techniques hinder the widespread adoption of
      parallel computing. In this paper, we propose a novel compiler framework
      for parallel computing, which aims to bridge the gap between parallel
      programming models and efficient compilation techniques. Our framework,
      called ParComp, is designed to support a wide range of parallel
      programming models, including OpenMP, MPI, and CUDA. ParComp consists of
      three main components: a parallelization module, a scheduling module, and
      a code generation module. The parallelization module is responsible for
      identifying parallelizable loops and transforming them into parallel code.
      The scheduling module is responsible for scheduling the parallel tasks and
      allocating resources to them. The code generation module is responsible
      for generating efficient parallel code from the scheduled tasks. We
      evaluate the performance of ParComp using a set of benchmark programs and
      compare it with state-of-the-art parallel compilers. The results show that
      ParComp outperforms the existing compilers in terms of execution time and
      scalability. Our framework is implemented using C++ and is available
      online for public use.


      Index Terms: Parallel computing, compiler design, parallel programming
      models, OpenMP, MPI, CUDA, code generation, scheduling, resource
      allocation.
inference: true
model-index:
  - name: SetFit with sentence-transformers/paraphrase-mpnet-base-v2
    results:
      - task:
          type: text-classification
          name: Text Classification
        dataset:
          name: Unknown
          type: unknown
          split: test
        metrics:
          - type: f1
            value: 0.6184
            name: F1

SetFit with sentence-transformers/paraphrase-mpnet-base-v2

This is a SetFit model that can be used for Text Classification. This SetFit model uses sentence-transformers/paraphrase-mpnet-base-v2 as the Sentence Transformer embedding model. A LogisticRegression instance is used for classification.

The model has been trained using an efficient few-shot learning technique that involves:

  1. Fine-tuning a Sentence Transformer with contrastive learning.
  2. Training a classification head with features from the fine-tuned Sentence Transformer.

Model Details

Model Description

Model Sources

Model Labels

Label Examples
Data Structures
  • 'Title: On the Complexity of Gröbner Bases for Toric Ideals\n\nAbstract: We investigate the computational complexity of computing Gröbner bases for toric ideals. Our main result is a polynomial-time algorithm for computing Gröbner bases for toric ideals in the case where the toric ideal is generated by a set of binomials. We also show that this algorithm can be used to solve a number of problems in computational algebra, including the computation of the Hilbert series of a toric ideal and the determination of the dimension of a toric variety. Our results have implications for the study of toric varieties and their applications in computer science and engineering.\n\nIntroduction\n\nToric varieties are a fundamental object of study in algebraic geometry, and have found numerous applications in computer science and engineering. In this paper, we investigate the computational complexity of computing Gröbner bases for toric ideals, which are a key tool in the study of toric varieties. Our main result is a polynomial-time algorithm for computing Gröbner bases for toric ideals in the case where the toric ideal is generated by a set of binomials.\n\nBackground\n\nA toric ideal is a polynomial ideal that is generated by a set of binomials. The Gröbner basis of a toric ideal is a set of binomials that generate the ideal and have a certain property called the S-polynomial property. The S-polynomial property is a key tool in the study of toric varieties, and has been used to solve a number of problems in computational algebra.\n\nMain Result\n\nOur main result is a polynomial-time algorithm for computing Gröbner bases for toric ideals in the case where the toric ideal is generated by a set of binomials. The algorithm works by first computing the Hilbert series of the toric ideal, and then using this information to compute the Gröbner basis. We show that this algorithm can be used to solve a number of problems in computational algebra, including the computation of the Hilbert series of a toric ideal and the determination of the dimension of a toric variety.\n\nConclusion\n\nIn this paper, we have investigated the computational complexity of computing Gröbner bases for toric ideals. Our main result is a polynomial-time algorithm for computing Gröbner bases for toric ideals in the case where the toric ideal is generated by a set of binomials. We believe that this result has implications for the study of toric varieties and their applications in computer science and engineering.'
  • "A novel approach to designing efficient hash tables for large-scale data storage is proposed in this paper. The proposed hash table, dubbed 'Efficient Hash Table' (EHT), employs a combination of open addressing and linear probing to minimize collisions and improve search times. Experimental results demonstrate that EHT outperforms existing hash table implementations in terms of search time and memory usage, making it an attractive solution for big data applications. The EHT algorithm is implemented using a C++ programming language and is shown to scale well on multi-core processors. This paper contributes to the field of data structures by providing a new, efficient, and scalable hash table design that can be used in a variety of applications, including databases, file systems, and cloud storage systems."
  • "Abstract: This paper presents an efficient array-based algorithm for searching and sorting large datasets. The proposed algorithm utilizes a combination of bit-packing and prefix sums to achieve a time complexity of O(n) for search operations and O(n log n) for sort operations. Experimental results demonstrate the algorithm's superiority over existing methods in terms of performance and memory usage. The algorithm is particularly useful for applications where data is stored in arrays, such as in computer vision and scientific simulations. The proposed algorithm is implemented in C++ and is available for download from the IEEE Xplore digital library."
Programming Languages
  • 'Title: A Functional Programming Approach to Type Inference in Higher-Order Logic Programming Languages.\nAbstract: This paper presents a novel approach to type inference in higher-order logic programming languages using functional programming techniques. We propose a type system that combines the benefits of higher-order logic programming with the expressive power of functional programming. Our approach is based on a novel type inference algorithm that uses a combination of type reconstruction and type checking. We demonstrate the effectiveness of our approach through a series of experiments on a set of benchmark programs. The results show that our approach outperforms existing type inference systems in terms of accuracy and efficiency. We also discuss the implications of our work for the design of future programming languages.'
  • 'A Comparative Study of Functional Programming Paradigms in Haskell and Scala for Efficient Software Development\nAbstract: Functional programming has gained significant attention in recent years due to its ability to promote modular, composable, and reusable code. This paper presents a comparative study of two popular functional programming languages, Haskell and Scala, with a focus on their application in efficient software development. We analyze the strengths and weaknesses of each language, highlighting their respective features and performance characteristics. Our results demonstrate that Haskell and Scala can be effectively used for building high-performance software systems, with Haskell exhibiting superior performance in certain scenarios. The findings of this study contribute to the ongoing debate on the choice of functional programming languages for software development and provide insights for practitioners and researchers alike.\nKeywords: functional programming, Haskell, Scala, software development, performance evaluation\n'
  • 'A Novel Type Theory for Dependent Type Systems in Programming Languages\n\nAbstract: This paper proposes a novel type theory for dependent type systems, which is a fundamental component of programming languages. The proposed type theory is based on a combination of ideas from homotopy type theory and dependent type theory. We show that the proposed type theory is sound and complete, and we provide a formal proof of its soundness. We also demonstrate the expressiveness of the proposed type theory by implementing a dependent type checker using it. The results of this paper demonstrate the potential of the proposed type theory for use in programming languages.\n\nKeywords: dependent type systems, programming languages, type theory, homotopy type theory, dependent type theory.\n\n1 Introduction\n\nDependent type systems are a fundamental component of programming languages, allowing programmers to specify and reason about the types of complex data structures. However, the design of dependent type systems is challenging due to the need to balance expressiveness and decidability. In this paper, we propose a novel type theory for dependent type systems, which is based on a combination of ideas from homotopy type theory and dependent type theory.\n\n2 Background\n\nDependent type systems are based on the concept of dependent types, which are types that depend on the values of other types. Dependent types are used to specify the types of complex data structures, such as lists and matrices. However, the design of dependent type systems is challenging due to the need to balance expressiveness and decidability.\n\n3 Proposed Type Theory\n\nIn this paper, we propose a novel type theory for dependent type systems, which is based on a combination of ideas from homotopy type theory and dependent type theory. The proposed type theory is sound and complete, and we provide a formal proof of its soundness. We also demonstrate the expressiveness of the proposed type theory by implementing a dependent type checker using it.\n\n4 Conclusion\n\nThe results of this paper demonstrate the potential of the proposed type theory for use in programming languages. The proposed type theory is sound and complete, and it provides a formal foundation for dependent type systems. We believe that the proposed type theory has the potential to be used in a wide range of programming languages, and we plan to continue exploring its applications in the future.\n\nReferences:\n\n[1] N. Ghani and P. J. Scott, "A type theory for dependent types," in Proceedings of the 22nd Annual Symposium on Logic in Computer Science, 2007, pp. 233-242.\n[2] P. J. Scott, "A type theory for dependent types," Ph.D. dissertation, University of Edinburgh, 2007.\n[3] A. K. Ghosh and P. J. Scott, "A type theory for dependent types," Journal of Functional Programming, vol. 19, no. 3-4, pp. 437-462, 2009.\n[4] P. J. Scott, "A type theory for dependent types," in Proceedings of the 25th Annual Symposium on Logic in Computer Science, 2010, pp. 231-240.\n[5] A. K. Ghosh and P. J. Scott, "A type theory for dependent types," Journal of Functional Programming, vol. 22, no. 2-3, pp. 147-172, 2012.\n[6] P. J. Scott, "A type theory for dependent types," in Proceedings of the 28th Annual Symposium on Logic in Computer Science, 2013, pp. 231-240.\n[7] A. K. Ghosh and P. J. Scott, "A type theory for dependent types," Journal of Functional Programming, vol. 25, no. 2-3, pp. 147-172, 2015.\n[8] P. J. Scott, "A type theory for dependent types," in Proceedings of the 31st Annual Symposium on Logic in Computer Science, 2016, pp. 231-240.\n[9] A. K. Ghosh and P. J. Scott, "A type theory for dependent types," Journal of Functional Programming, vol. 28, no. 2-3, pp. 147-172, 2018.\n[10] P. J. Scott, "A type theory for dependent types," in Proceedings of the 34th Annual Symposium on Logic in Computer Science, 2019, pp. 231-240.\n\n'
Information Theory
  • 'A Novel Approach to Secure Data Transmission Using Quantum Key Distribution\n\nAbstract: This paper proposes a novel approach to secure data transmission using quantum key distribution (QKD). The proposed scheme utilizes the principles of quantum mechanics to enable secure key exchange between two parties. The scheme is based on the BB84 protocol, which is a widely used QKD protocol. However, the proposed scheme introduces a new feature, known as the "quantum error correction" mechanism, which enables the detection of any eavesdropping attempts. The proposed scheme is analyzed using the security analysis framework, which shows that it is secure against any eavesdropping attempts. The performance of the proposed scheme is evaluated using simulations, which show that it outperforms existing QKD schemes in terms of key generation rate and security. The proposed scheme has the potential to be used in various applications, including secure data transmission over the internet.'
  • 'A Secure Communication Framework for IoT Devices using Quantum Key Distribution and Homomorphic Encryption\n\nAbstract: The Internet of Things (IoT) has revolutionized the way we live and work, connecting billions of devices worldwide. However, the increased connectivity also raises significant security concerns, as IoT devices are often vulnerable to cyber attacks. In this paper, we propose a secure communication framework for IoT devices using quantum key distribution (QKD) and homomorphic encryption. Our framework leverages the principles of QKD to establish secure key exchange between IoT devices, while homomorphic encryption enables secure data processing without revealing sensitive information. We demonstrate the effectiveness of our framework through simulations and experiments, showcasing its ability to provide robust security against various types of attacks. The results of this study contribute to the development of secure IoT communication systems, which are essential for the widespread adoption of IoT technology.\n\nKeywords: Quantum Key Distribution, Homomorphic Encryption, IoT Security, Secure Communication Framework\n\nI. Introduction\n\nThe Internet of Things (IoT) has transformed the way we live and work, connecting billions of devices worldwide. However, the increased connectivity also raises significant security concerns, as IoT devices are often vulnerable to cyber attacks. In this paper, we propose a secure communication framework for IoT devices using quantum key distribution (QKD) and homomorphic encryption.\n\nII. Background\n\nQuantum Key Distribution (QKD) is a method of secure key exchange that relies on the principles of quantum mechanics. QKD enables two parties to establish a shared secret key over an insecure communication channel, without revealing the key to any third party. Homomorphic encryption, on the other hand, is a type of encryption that enables computations to be performed on encrypted data without decrypting it first.\n\nIII. Proposed Framework\n\nOur proposed framework consists of two main components: QKD-based key exchange and homomorphic encryption-based data processing. The QKD component uses the BB84 protocol to establish a secure key exchange between IoT devices, while the homomorphic encryption component uses the HElib library to perform secure data processing.\n\nIV. Simulation and Experiment Results\n\nWe conducted simulations and experiments to evaluate the effectiveness of our framework. The results show that our framework provides robust security against various types of attacks, including eavesdropping and tampering attacks. We also demonstrate the ability of our framework to provide secure data processing without revealing sensitive information.\n\nV. Conclusion\n\nIn this paper, we proposed a secure communication framework for IoT devices using QKD and homomorphic encryption. Our framework provides robust security against various types of attacks and enables secure data processing without revealing sensitive information. The results of this study contribute to the development of secure IoT communication systems, which are essential for the widespread adoption of IoT technology.'
  • 'A Novel Turbo Code Design for Near-Capacity Performance in Wireless Communication Systems\nAbstract—Turbo codes have been widely adopted in various wireless communication systems due to their near-capacity performance and low complexity. In this paper, we propose a novel turbo code design that achieves better performance than the traditional turbo code. The proposed design is based on a new interleaving scheme that combines the benefits of random and systematic interleaving. Simulation results show that the proposed turbo code outperforms the traditional turbo code in terms of bit error rate and frame error rate. The proposed design is also compared with other state-of-the-art turbo code designs, and the results show that it achieves better performance. The proposed turbo code is suitable for various wireless communication systems, including 5G and beyond. The design and implementation of the proposed turbo code are discussed in detail, and the simulation results are presented to demonstrate its performance.\nKeywords—Turbo codes, interleaving scheme, near-capacity performance, wireless communication systems, 5G and beyond.'
Group Theory
  • 'Title: On the Structure of the Centralizer of a Toral Subgroup in a Reductive Algebraic Group\n\nAbstract: We study the centralizer of a toral subgroup in a reductive algebraic group over an algebraically closed field of characteristic zero. Our main result describes the structure of this centralizer in terms of the root system of the group and the weights of the toral subgroup. We also provide a characterization of the centralizer in terms of the Bruhat-Tits building of the group. Our methods involve a combination of Lie algebra techniques, including the use of the Killing form and the Cartan-Killing classification of simple Lie algebras, as well as geometric and algebraic techniques, including the use of the Bruhat-Tits building and the theory of algebraic groups over local fields.\n\nIntroduction: The centralizer of a toral subgroup in a reductive algebraic group is a fundamental object of study in the theory of algebraic groups. In this paper, we investigate the structure of this centralizer, with a focus on the case where the toral subgroup is a maximal torus. Our main result provides a detailed description of the centralizer in this case, and we also provide a characterization of the centralizer in terms of the Bruhat-Tits building of the group. We hope that our results will be of interest to researchers in the field of algebraic groups and Lie theory.\n\n1 Introduction\n\n1.1 Background and Motivation\n\n1.2 Main Results\n\n2 Preliminaries\n\n2.1 Algebraic Groups and Lie Algebras\n\n2.2 Root Systems and Weights\n\n2.3 Bruhat-Tits Buildings\n\n3 The Centralizer of a Toral Subgroup\n\n3.1 Definition and Basic Properties\n\n3.2 Structure of the Centralizer\n\n3.3 Characterization of the Centralizer\n\n4 Applications and Further Directions\n\n4.1 Applications to Algebraic Groups\n\n4.2 Further Directions\n\nReferences\n\nBibliography'
  • 'Geometric Invariant Theory (GIT) is a branch of algebraic geometry that studies the action of an algebraic group on an algebraic variety. It provides a framework for understanding the symmetries of algebraic varieties and has applications in various areas of mathematics, including commutative algebra, algebraic geometry, and representation theory. In this paper, we apply GIT to study the invariant theory of a certain algebraic group action on a projective variety. We use the Hilbert-Mumford criterion to determine the semistable points of the action and then compute the invariant ring using the Grothendieck ring of the group. Our results have implications for the study of algebraic groups and their actions on projective varieties.'
  • 'Title: A Geometric Approach to Representations of Finite Groups\n\nAbstract: We introduce a new method for constructing representations of finite groups using algebraic geometry. Our approach is based on the idea of representing a group as a quotient of a reductive group by a finite subgroup. We show that this construction yields a faithful representation of the group, and we use it to compute the character table of the symmetric group S_5. Our method has several advantages over existing methods, including the ability to handle large groups and the flexibility to incorporate additional structure. We also discuss some potential applications of our method, including the computation of representation theory for finite groups of Lie type.\n\nKeywords: representation theory, algebraic geometry, finite groups, reductive groups, symmetric group\n\nArXiv ID: 2203.10201\n\nSubmission date: 2022-03-17'
Neural and Evolutionary
  • 'Evolutionary algorithms have been widely used in various optimization problems due to their ability to efficiently search for optimal solutions. In this paper, we propose a novel hybrid approach that combines the strengths of genetic algorithms and differential evolution to solve complex optimization problems. The proposed method, called GEDE, integrates the exploration capabilities of genetic algorithms with the exploitation capabilities of differential evolution. We evaluate the performance of GEDE on several benchmark problems and compare it with other state-of-the-art algorithms. The results show that GEDE outperforms the other algorithms in terms of convergence speed and solution quality. We also analyze the convergence behavior of GEDE and provide insights into its performance. The proposed approach has the potential to be applied to a wide range of optimization problems in various fields, including engineering, economics, and computer science.'
  • 'Title: A Deep Learning Approach for Sentiment Analysis of Text Data\nAbstract: This paper proposes a novel deep learning model for sentiment analysis of text data. The proposed model combines the strengths of convolutional neural networks (CNNs) and recurrent neural networks (RNNs) to effectively capture the spatial and temporal dependencies in text data. Experimental results on several benchmark datasets demonstrate the superiority of the proposed model over state-of-the-art methods. The proposed model achieves an accuracy of 92.5% on the IMDB dataset, outperforming the best existing method by 2.5%. The results also show that the proposed model is robust to noise and can handle out-of-vocabulary words. The proposed model is a significant contribution to the field of natural language processing and has the potential to be applied to various real-world applications.'
  • 'Title: A Novel Hybrid Approach for Deep Learning-based Optimization of Evolutionary Algorithms\n\nAbstract: This paper proposes a novel hybrid approach that combines the strengths of deep learning and evolutionary algorithms to optimize complex optimization problems. We introduce a new neural network architecture that learns to adapt the parameters of evolutionary algorithms in real-time, leading to improved convergence rates and better solution quality. Our approach is evaluated on a range of benchmark problems and compared to state-of-the-art methods. The results show that our hybrid approach outperforms existing methods in terms of convergence speed and solution quality. We also provide a comprehensive analysis of the proposed approach and discuss its potential applications in various fields.\n\nKeywords: Evolutionary algorithms, Deep learning, Optimization, Hybrid approach, Neural networks.'
Commutative Algebra
  • "Title: On the Frobenius Ideals of a Commutative Ring. Abstract: We study the Frobenius ideals of a commutative ring and provide a characterization of the Frobenius ideals in terms of the ring's structure. Our main result shows that the Frobenius ideal of a commutative ring is a finitely generated ideal if and only if the ring is a finitely generated module over its endomorphism ring. We also investigate the relationship between the Frobenius ideal and the ring's dimension. Our results have implications for the study of commutative algebra and the theory of Frobenius ideals. Keywords: Frobenius ideal, commutative ring, finitely generated ideal, endomorphism ring, dimension. Source: Google Scholar."
  • 'arXiv:2207.12345v1 [math.AG] 18 Jul 2022\n\nTitle: On the arithmetic of certain K3 surfaces\n\nAbstract: We study the arithmetic of certain K3 surfaces defined over the rationals, with a focus on their Picard groups and Neron-Severi lattices. Our main result is a complete classification of these surfaces in terms of their invariants, which we compute using a combination of algebraic geometry and number theory techniques. We also provide explicit examples of such surfaces, and discuss their implications for the study of arithmetic geometry.\n\nIntroduction\n\nLet $S$ be a K3 surface defined over the rationals, and let $Pic(S)$ denote its Picard group. The Neron-Severi lattice $NS(S)$ is the subgroup of $Pic(S)$ generated by the divisor classes of the curve $S$. Our main goal is to classify the K3 surfaces $S$ such that $NS(S)$ is isomorphic to a lattice of the form $U \times E_8$, where $U$ is a hyperbolic plane and $E_8$ is the standard $E_8$ lattice. We achieve this by first showing that such a surface must have a certain type of singular point, and then using this information to compute the invariants of $NS(S)$. Our classification result has implications for the study of arithmetic geometry, and provides a new perspective on the geometry of K3 surfaces.'
  • 'arXiv:2203.01023v1 [math.RT] 1 Mar 2022\n\nTitle: On the representation theory of the Iwahori-Hecke algebra of the symmetric group\n\nAbstract: We study the representation theory of the Iwahori-Hecke algebra of the symmetric group. Our main result is a classification of the irreducible representations of this algebra in terms of the representation theory of the symmetric group. We also provide a new proof of the fact that the Iwahori-Hecke algebra is a semisimple algebra. Our methods involve a combination of representation theory, algebraic geometry, and combinatorics.\n\n1 Introduction\n\nThe Iwahori-Hecke algebra of the symmetric group is a well-studied algebra that has connections to many areas of mathematics, including representation theory, algebraic geometry, and combinatorics. In this paper, we study the representation theory of this algebra, with a focus on classifying the irreducible representations. Our main result is a classification of the irreducible representations of the Iwahori-Hecke algebra in terms of the representation theory of the symmetric group. We also provide a new proof of the fact that the Iwahori-Hecke algebra is a semisimple algebra. Our methods involve a combination of representation theory, algebraic geometry, and combinatorics.\n\n2 Background\n\nIn this section, we provide some background on the representation theory of the symmetric group and the Iwahori-Hecke algebra. We recall the definition of the Iwahori-Hecke algebra and its properties, and we also recall some results on the representation theory of the symmetric group.\n\n3 The Representation Theory of the Iwahori-Hecke Algebra\n\nIn this section, we study the representation theory of the Iwahori-Hecke algebra. We provide a classification of the irreducible representations of this algebra in terms of the representation theory of the symmetric group. We also provide a new proof of the fact that the Iwahori-Hecke algebra is a semisimple algebra.\n\n4 Conclusion\n\nIn this paper, we have studied the representation theory of the Iwahori-Hecke algebra of the symmetric group. Our main result is a classification of the irreducible representations of this algebra in terms of the representation theory of the symmetric group. We have also provided a new proof of the fact that the Iwahori-Hecke algebra is a semisimple algebra. We believe that our results will have applications in many areas of mathematics, including representation theory, algebraic geometry, and combinatorics.\n\nReferences\n\n[1] Ariki, S. (1996). On the decomposition numbers of the Hecke algebra of the symmetric group. Journal of Algebra, 183(2), 371-394.\n[2] Ariki, S., & Koike, K. (1994). A Hecke algebra of (Z/rZ)Sn and approximation of the irreducible characters of the symmetric group. Journal of Algebra, 171(2), 311-346.\n[3] Dipper, R., & James, G. (1988). Representations of the symmetric group which are irreducible over the commutator subgroup. Mathematische Zeitschrift, 198(2), 151-166.\n[4] Green, J. A. (1955). Axiomatic approach to the representation theory of the symmetric group. Journal of Algebra, 1(2), 107-133.\n[5] James, G. (1978). The representation theory of the symmetric group. Lecture Notes in Mathematics, 682, 1-42.\n[6] Murphy, G. J. (1990). On the representation theory of the symmetric group. Journal of Algebra, 131(2), 449-465.\n[7] Nakayama, T. (1952). On the representations of the symmetric group. Journal of the Faculty of Science, University of Tokyo, 6(2), 147-172.\n[8] Sagan, B. E. (1991). The symmetric group: Representations, combinatorial algorithms, and symmetric functions. Wadsworth & Brooks/Cole.\n[9] Zelevinsky, A. (1980). Representations of the symmetric group which are irreducible over the commutator subgroup. Mathematische Zeitschrift, 173(2), 133-146.\n[10] Zelevinsky, A. (1981). Representations of the symmetric group which are irreducible over the commutator subgroup. Journal of Algebra, 71(2), 249-262.\n\n\n'
Systems and Control
  • 'Optimal Control Theory for Nonlinear Systems with Bounded Controls\n\nAbstract: This paper presents a new approach to optimal control theory for nonlinear systems with bounded controls. We propose a novel method for solving the Hamilton-Jacobi-Isaacs equation, which is a fundamental equation in optimal control theory. Our approach is based on a combination of deep learning and numerical methods, and it is capable of handling high-dimensional systems with nonlinear dynamics. We demonstrate the effectiveness of our method through numerical experiments on several benchmark problems, including a nonlinear pendulum and a nonlinear cart-pole system. Our results show that our method can achieve better performance than existing methods, and it is computationally efficient. We also provide a theoretical analysis of our method, and we show that it converges to the optimal solution under certain conditions. The proposed method has the potential to be applied to a wide range of fields, including robotics, aerospace engineering, and biomedical engineering.'
  • 'Title: A Robust Control Approach for Uncertain Systems with Time-Varying Delays\nAbstract: This paper presents a robust control strategy for uncertain systems with time-varying delays. The proposed method combines a model predictive control (MPC) scheme with a robust control approach to ensure stability and performance of the closed-loop system. The MPC scheme is designed to handle the time-varying delays, while the robust control approach ensures that the system remains stable despite the presence of uncertainties. The effectiveness of the proposed method is demonstrated through numerical simulations and experimental results on a laboratory setup. The results show that the proposed method outperforms traditional robust control approaches in terms of stability and performance. The proposed method has the potential to be applied to a wide range of uncertain systems with time-varying delays, such as those encountered in robotics, aerospace, and process control.'
  • 'H-infinity Control in Robust Control\n\nAbstract: This paper presents a novel approach to robust control design using H-infinity control theory. The proposed method combines the advantages of both H-infinity and mu-synthesis techniques to achieve improved robustness and performance. Theoretical results are supported by numerical examples and comparisons with existing methods. The proposed approach is demonstrated on a benchmark problem and shows significant improvements in terms of robust stability and performance.\n\nIntroduction\n\nH-infinity control theory has been widely used in robust control design due to its ability to provide guaranteed robust stability and performance. However, the existing methods often suffer from conservativeness and computational complexity. In this paper, we propose a novel approach that combines the advantages of H-infinity and mu-synthesis techniques to achieve improved robustness and performance. The proposed method is based on a new optimization problem formulation that takes into account the uncertainties and disturbances in the system.\n\nMethodology\n\nThe proposed approach is based on the following steps: (1) model the system using a state-space representation; (2) formulate the H-infinity control problem using the mu-synthesis framework; (3) solve the optimization problem to obtain the controller gains; and (4) implement the controller using a digital signal processor. The proposed approach is demonstrated on a benchmark problem, which is a two-input two-output system with uncertainties in the plant and disturbances in the input.\n\nResults\n\nThe proposed approach is compared with existing methods, including H-infinity control and mu-synthesis. The results show that the proposed approach achieves improved robust stability and performance, with a significant reduction in the control effort. The proposed approach is also compared with a state-of-the-art robust control method, which shows that the proposed approach outperforms the existing method in terms of robust stability and performance.\n\nConclusion\n\nIn this paper, we have presented a novel approach to robust control design using H-infinity control theory. The proposed method combines the advantages of both H-infinity and mu-synthesis techniques to achieve improved robustness and performance. Theoretical results are supported by numerical examples and comparisons with existing methods. The proposed approach is demonstrated on a benchmark problem and shows significant improvements in terms of robust stability and performance.'
Statistics Theory
  • 'Title: A New Perspective on the Generalization Error of Support Vector Machines\nAbstract: We provide a new bound on the generalization error of support vector machines (SVMs) in terms of the Rademacher complexity of the reproducing kernel Hilbert space (RKHS) of the kernel. Our bound is tighter than existing bounds and has a simpler form. We also provide a new algorithm for learning the kernel, which is based on the idea of minimizing the empirical risk with respect to the RKHS norm. We demonstrate the effectiveness of our approach on several benchmark datasets.'
  • 'Title: A Bayesian Approach to Hypothesis Testing for High-Dimensional Data\n\nAbstract: Hypothesis testing is a fundamental problem in statistics, and its applications are widespread in various fields. However, the traditional methods of hypothesis testing often fail to perform well in high-dimensional data settings. In this paper, we propose a novel Bayesian approach to hypothesis testing for high-dimensional data. Our method combines the strengths of Bayesian inference and dimensionality reduction techniques to provide a robust and efficient solution to the hypothesis testing problem. We demonstrate the effectiveness of our approach through extensive simulations and real-world experiments on high-dimensional data sets. The results show that our method outperforms existing methods in terms of accuracy and computational efficiency. Furthermore, we provide a theoretical analysis of our approach, which provides insights into its performance and limitations. Our method has the potential to be applied to a wide range of applications, including image analysis, genomics, and finance. The code and data used in this paper are available online for reproducibility purposes.'
  • 'Title: Bayesian Network Learning with Gaussian Process Priors for Uncertainty Quantification in High-Dimensional Systems\n\nAbstract: Bayesian networks are a powerful tool for modeling complex systems with uncertainty. However, in high-dimensional systems, the computational cost of learning Bayesian networks can be prohibitively expensive. In this paper, we propose a novel approach to Bayesian network learning using Gaussian process priors. Our approach, which we call Bayesian network learning with Gaussian process priors (BN-GP), leverages the flexibility of Gaussian processes to model the uncertainty in the network structure. We demonstrate the effectiveness of BN-GP on several high-dimensional systems, including a synthetic dataset and a real-world dataset from the field of systems biology. Our results show that BN-GP can learn accurate Bayesian networks with significantly reduced computational cost compared to traditional methods. Furthermore, we provide a theoretical analysis of the convergence properties of BN-GP, which shows that it can learn consistent estimates of the network structure even in the presence of high-dimensional data. Our approach has the potential to enable the widespread adoption of Bayesian networks in high-dimensional systems, where traditional methods are often infeasible.\n\nKeywords: Bayesian networks, Gaussian process priors, uncertainty quantification, high-dimensional systems, systems biology.'
Artificial Intelligence
  • 'Title: Learning Hierarchical Representations for Robust Visual Perception in Autonomous Systems\nAbstract: We propose a novel deep learning approach for visual perception in autonomous systems, which leverages hierarchical representations to improve robustness and accuracy. Our method combines a convolutional neural network (CNN) with a recurrent neural network (RNN) to learn a hierarchical representation of visual data. We evaluate our approach on several benchmark datasets and demonstrate significant improvements in performance compared to state-of-the-art methods. Our results show that the proposed approach can learn robust and accurate representations of visual data, even in the presence of significant occlusions and variations in lighting conditions. We also provide a detailed analysis of the learned representations and demonstrate their applicability to various tasks in autonomous systems. This work makes significant contributions to the field of computer vision and robotics, and has the potential to enable more robust and accurate visual perception in autonomous systems.'
  • 'Title: A Deep Learning Approach for Text Classification: A Comparative Study\n\nAbstract: Text classification is a fundamental task in natural language processing (NLP) that has numerous applications in various domains. In this paper, we propose a deep learning approach for text classification using convolutional neural networks (CNNs) and recurrent neural networks (RNNs). We compare the performance of our proposed approach with state-of-the-art methods on several benchmark datasets. Our results show that our approach outperforms the existing methods in terms of accuracy and F1-score. We also analyze the effect of different hyperparameters on the performance of our approach and provide insights into the importance of feature extraction in text classification. This study contributes to the development of efficient and accurate text classification models using deep learning techniques.\n\nKeywords: text classification, deep learning, convolutional neural networks, recurrent neural networks, natural language processing.'
  • 'Title: Investigating the Impact of Attention Mechanisms on Deep Learning Models for Sentiment Analysis.\n\nAbstract: This paper explores the effects of incorporating attention mechanisms into deep learning models for sentiment analysis. We propose a novel architecture that combines the strengths of recurrent neural networks (RNNs) and attention mechanisms to improve the performance of sentiment analysis tasks. Our experimental results demonstrate that the proposed model outperforms state-of-the-art models in terms of accuracy and F1-score. Furthermore, we conduct an ablation study to investigate the impact of different attention mechanisms on the performance of the model. Our findings suggest that the proposed attention mechanism is more effective than other attention mechanisms in improving the performance of sentiment analysis tasks.\n\nKeywords: deep learning, attention mechanisms, sentiment analysis, natural language processing, neural networks.\n\nSource: Google Scholar.'
Computational Engineering
  • 'Title: An Efficient Seismic Inversion Method Using Deep Learning for Reservoir Characterization\nAbstract: Seismic inversion is a crucial step in reservoir characterization, and its accuracy directly affects the economic viability of hydrocarbon exploration and production. In this paper, we propose a novel seismic inversion method based on deep learning that can efficiently handle large-scale seismic data. The proposed method utilizes a convolutional neural network (CNN) to learn the mapping between seismic data and reservoir properties. We demonstrate the effectiveness of our method using a real-world dataset and show that it outperforms traditional methods in terms of accuracy and computational efficiency. Our results indicate that the proposed method can be a valuable tool for seismic inversion and reservoir characterization.\nKeywords: seismic inversion, deep learning, reservoir characterization, convolutional neural network, computational engineering.'
  • "A Novel Finite Element Method for Nonlinear Structural Analysis of Composite Materials\nAbstract: This paper presents a novel finite element method for nonlinear structural analysis of composite materials. The proposed method is based on a combination of the extended finite element method (XFEM) and the peridynamic theory (PDT). The XFEM is used to model the nonlinear behavior of the composite material, while the PDT is used to capture the long-range interactions between the material's particles. The proposed method is implemented in a computational framework and is validated using several numerical examples. The results show that the proposed method can accurately capture the nonlinear behavior of composite materials and can be used to predict the structural response of complex composite structures. The proposed method has the potential to be used in various engineering applications, including the design and analysis of composite structures for aerospace, automotive, and civil engineering."
  • 'Title: A Novel Finite Element Method for Simulating Nonlinear Dynamics in Composite Materials\n\nAbstract: This paper presents a new finite element method for simulating nonlinear dynamics in composite materials. The proposed method combines the advantages of the partition of unity method and the extended finite element method to capture the complex behavior of composite materials under various loading conditions. The numerical results show that the proposed method can accurately predict the nonlinear dynamics of composite materials, including the effects of material nonlinearity and geometric nonlinearity. The proposed method is also compared with other existing methods, and the results show that it has better accuracy and efficiency.\n\nKeywords: finite element method, nonlinear dynamics, composite materials, partition of unity method, extended finite element method.\n\nArXiv ID: 2203.03045\n\nSubmission date: 2022-03-07\n\n'
Computer Vision
  • 'A Novel Approach to Object Detection using Convolutional Neural Networks\n\nAbstract: Object detection is a fundamental task in computer vision, and its applications are vast in various fields. In this paper, we propose a novel approach to object detection using convolutional neural networks (CNNs). Our method, called Object Detection using CNNs (ODCNN), is based on a combination of region proposal networks (RPNs) and CNNs. We train the ODCNN model on the PASCAL VOC 2007 dataset and evaluate its performance on the PASCAL VOC 2012 dataset. The results show that our approach outperforms the state-of-the-art methods in terms of accuracy and speed. We also provide a detailed analysis of the ODCNN model and its components. The code for the ODCNN model is available at https://github.com/odcnn/odcnn.\n\nKeywords: Object detection, Convolutional neural networks, Region proposal networks, PASCAL VOC 2007, PASCAL VOC 2012.'
  • 'A Novel Object Recognition Framework for Autonomous Robots using Deep Learning and Computer Vision Techniques\n\nAbstract: This paper proposes a novel object recognition framework for autonomous robots that leverages the power of deep learning and computer vision techniques. The proposed framework consists of two stages: a detection stage and a recognition stage. In the detection stage, a convolutional neural network (CNN) is used to detect objects in the scene, while in the recognition stage, a recurrent neural network (RNN) is employed to recognize the detected objects. The proposed framework is evaluated on a dataset of images collected from a robotic platform, and the results show that it outperforms state-of-the-art methods in terms of accuracy and speed. The proposed framework has the potential to be used in various applications, including robotics, autonomous vehicles, and surveillance systems.\n\nKeywords: Object recognition, autonomous robots, deep learning, computer vision, convolutional neural networks, recurrent neural networks.\n\n'
  • 'A novel approach to image classification using convolutional neural networks (CNNs) is proposed in this paper. The proposed method, dubbed "Deep Image Classifier", leverages the power of CNNs to learn hierarchical features from images. Experimental results on several benchmark datasets, including CIFAR-10 and ImageNet, demonstrate the efficacy of the proposed method in achieving state-of-the-art performance. The code for the proposed method is made available on GitHub, allowing for easy reproduction and extension of the results. The contributions of this paper can be summarized as follows: (1) a novel CNN architecture is proposed, which consists of multiple convolutional and pooling layers, followed by fully connected layers; (2) a novel training strategy is proposed, which involves data augmentation and batch normalization; (3) the proposed method is evaluated on several benchmark datasets, and the results are compared with state-of-the-art methods. The results of this paper demonstrate the potential of CNNs in image classification tasks, and provide a new benchmark for future research in this area.'

Evaluation

Metrics

Label F1
all 0.6184

Uses

Direct Use for Inference

First install the SetFit library:

pip install setfit

Then you can load this model and run inference.

from setfit import SetFitModel

# Download from the 🤗 Hub
model = SetFitModel.from_pretrained("setfit_model_id")
# Run inference
preds = model("Let G be a finite group and let V be a finite-dimensional representation of G over an algebraically closed field k. We say that V is a representation of G in characteristic zero if the characteristic of k is zero. In this paper, we investigate the structure of the representation ring R(G) of a finite group G in characteristic zero. We show that R(G) is isomorphic to the group ring k[G] if and only if G is a cyclic group. Furthermore, we provide a characterization of the representation rings of finite abelian groups in terms of their irreducible representations. Our results have implications for the study of the representation theory of finite groups in characteristic zero.")

Training Details

Training Set Metrics

Training set Min Median Max
Word count 69 220.7380 1079
Label Training Sample Count
Commutative Algebra 15
Computer Vision 12
Artificial Intelligence 16
Systems and Control 19
Group Theory 21
Computational Engineering 16
Programming Languages 13
Information Theory 21
Data Structures 21
Neural and Evolutionary 21
Statistics Theory 12

Training Hyperparameters

  • batch_size: (16, 16)
  • num_epochs: (5, 5)
  • max_steps: -1
  • sampling_strategy: oversampling
  • body_learning_rate: (2e-05, 1e-05)
  • head_learning_rate: 0.01
  • loss: CosineSimilarityLoss
  • distance_metric: cosine_distance
  • margin: 0.25
  • end_to_end: False
  • use_amp: False
  • warmup_proportion: 0.1
  • seed: 42
  • eval_max_steps: -1
  • load_best_model_at_end: True

Training Results

Epoch Step Training Loss Validation Loss
0.0005 1 0.158 -
0.0253 50 0.1482 -
0.0505 100 0.1408 -
0.0758 150 0.1071 -
0.1011 200 0.1294 -
0.1263 250 0.0782 -
0.1516 300 0.0628 -
0.1769 350 0.0909 -
0.2021 400 0.0161 -
0.2274 450 0.0068 -
0.2527 500 0.011 -
0.2779 550 0.0027 -
0.3032 600 0.0018 -
0.3284 650 0.0011 -
0.3537 700 0.0037 -
0.3790 750 0.0015 -
0.4042 800 0.0012 -
0.4295 850 0.0006 -
0.4548 900 0.0013 -
0.4800 950 0.0004 -
0.5053 1000 0.0003 -
0.5306 1050 0.0001 -
0.5558 1100 0.0007 -
0.5811 1150 0.0001 -
0.6064 1200 0.0004 -
0.6316 1250 0.0001 -
0.6569 1300 0.0001 -
0.6822 1350 0.0006 -
0.7074 1400 0.0002 -
0.7327 1450 0.0002 -
0.7580 1500 0.0001 -
0.7832 1550 0.0001 -
0.8085 1600 0.0001 -
0.8338 1650 0.0001 -
0.8590 1700 0.0002 -
0.8843 1750 0.0001 -
0.9096 1800 0.0001 -
0.9348 1850 0.0001 -
0.9601 1900 0.0001 -
0.9853 1950 0.0001 -
1.0 1979 - 0.0359
1.0106 2000 0.0001 -
1.0359 2050 0.0001 -
1.0611 2100 0.0002 -
1.0864 2150 0.0001 -
1.1117 2200 0.0002 -
1.1369 2250 0.0001 -
1.1622 2300 0.0 -
1.1875 2350 0.0003 -
1.2127 2400 0.0001 -
1.2380 2450 0.0001 -
1.2633 2500 0.0001 -
1.2885 2550 0.0 -
1.3138 2600 0.0 -
1.3391 2650 0.0001 -
1.3643 2700 0.0046 -
1.3896 2750 0.0044 -
1.4149 2800 0.0005 -
1.4401 2850 0.0002 -
1.4654 2900 0.0001 -
1.4907 2950 0.0 -
1.5159 3000 0.0001 -
1.5412 3050 0.0001 -
1.5664 3100 0.0001 -
1.5917 3150 0.0001 -
1.6170 3200 0.0 -
1.6422 3250 0.0 -
1.6675 3300 0.0 -
1.6928 3350 0.0 -
1.7180 3400 0.0001 -
1.7433 3450 0.0 -
1.7686 3500 0.0 -
1.7938 3550 0.0001 -
1.8191 3600 0.0 -
1.8444 3650 0.0 -
1.8696 3700 0.0 -
1.8949 3750 0.0 -
1.9202 3800 0.0 -
1.9454 3850 0.0 -
1.9707 3900 0.0 -
1.9960 3950 0.0 -
2.0 3958 - 0.0579
2.0212 4000 0.0 -
2.0465 4050 0.0 -
2.0718 4100 0.0001 -
2.0970 4150 0.0001 -
2.1223 4200 0.0 -
2.1475 4250 0.0 -
2.1728 4300 0.0 -
2.1981 4350 0.0 -
2.2233 4400 0.0 -
2.2486 4450 0.0 -
2.2739 4500 0.0 -
2.2991 4550 0.0 -
2.3244 4600 0.0001 -
2.3497 4650 0.0 -
2.3749 4700 0.0001 -
2.4002 4750 0.0 -
2.4255 4800 0.0 -
2.4507 4850 0.0001 -
2.4760 4900 0.0 -
2.5013 4950 0.0 -
2.5265 5000 0.0 -
2.5518 5050 0.0 -
2.5771 5100 0.0 -
2.6023 5150 0.0 -
2.6276 5200 0.0 -
2.6529 5250 0.0 -
2.6781 5300 0.0 -
2.7034 5350 0.0001 -
2.7287 5400 0.0 -
2.7539 5450 0.0 -
2.7792 5500 0.0001 -
2.8044 5550 0.0 -
2.8297 5600 0.0 -
2.8550 5650 0.0 -
2.8802 5700 0.0 -
2.9055 5750 0.0 -
2.9308 5800 0.0 -
2.9560 5850 0.0 -
2.9813 5900 0.0 -
3.0 5937 - 0.0557
3.0066 5950 0.0 -
3.0318 6000 0.0 -
3.0571 6050 0.0 -
3.0824 6100 0.0 -
3.1076 6150 0.0 -
3.1329 6200 0.0 -
3.1582 6250 0.0 -
3.1834 6300 0.0 -
3.2087 6350 0.0 -
3.2340 6400 0.0 -
3.2592 6450 0.0 -
3.2845 6500 0.0 -
3.3098 6550 0.0 -
3.3350 6600 0.0 -
3.3603 6650 0.0 -
3.3855 6700 0.0 -
3.4108 6750 0.0 -
3.4361 6800 0.0 -
3.4613 6850 0.0 -
3.4866 6900 0.0 -
3.5119 6950 0.0 -
3.5371 7000 0.0 -
3.5624 7050 0.0 -
3.5877 7100 0.0 -
3.6129 7150 0.0 -
3.6382 7200 0.0 -
3.6635 7250 0.0 -
3.6887 7300 0.0 -
3.7140 7350 0.0 -
3.7393 7400 0.0 -
3.7645 7450 0.0 -
3.7898 7500 0.0 -
3.8151 7550 0.0 -
3.8403 7600 0.0 -
3.8656 7650 0.0 -
3.8909 7700 0.0 -
3.9161 7750 0.0 -
3.9414 7800 0.0 -
3.9666 7850 0.0 -
3.9919 7900 0.0 -
4.0 7916 - 0.0543
4.0172 7950 0.0 -
4.0424 8000 0.0 -
4.0677 8050 0.0 -
4.0930 8100 0.0 -
4.1182 8150 0.0 -
4.1435 8200 0.0 -
4.1688 8250 0.0 -
4.1940 8300 0.0 -
4.2193 8350 0.0 -
4.2446 8400 0.0 -
4.2698 8450 0.0 -
4.2951 8500 0.0 -
4.3204 8550 0.0 -
4.3456 8600 0.0 -
4.3709 8650 0.0 -
4.3962 8700 0.0 -
4.4214 8750 0.0 -
4.4467 8800 0.0 -
4.4720 8850 0.0 -
4.4972 8900 0.0 -
4.5225 8950 0.0 -
4.5478 9000 0.0 -
4.5730 9050 0.0 -
4.5983 9100 0.0 -
4.6235 9150 0.0 -
4.6488 9200 0.0 -
4.6741 9250 0.0 -
4.6993 9300 0.0 -
4.7246 9350 0.0 -
4.7499 9400 0.0 -
4.7751 9450 0.0 -
4.8004 9500 0.0 -
4.8257 9550 0.0 -
4.8509 9600 0.0 -
4.8762 9650 0.0 -
4.9015 9700 0.0 -
4.9267 9750 0.0 -
4.9520 9800 0.0 -
4.9773 9850 0.0 -
5.0 9895 - 0.0537
  • The bold row denotes the saved checkpoint.

Framework Versions

  • Python: 3.9.19
  • SetFit: 1.1.0.dev0
  • Sentence Transformers: 3.0.1
  • Transformers: 4.39.0
  • PyTorch: 2.4.0
  • Datasets: 2.20.0
  • Tokenizers: 0.15.2

Citation

BibTeX

@article{https://doi.org/10.48550/arxiv.2209.11055,
    doi = {10.48550/ARXIV.2209.11055},
    url = {https://arxiv.org/abs/2209.11055},
    author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
    keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
    title = {Efficient Few-Shot Learning Without Prompts},
    publisher = {arXiv},
    year = {2022},
    copyright = {Creative Commons Attribution 4.0 International}
}