diff --git "a/README.md" "b/README.md" new file mode 100644--- /dev/null +++ "b/README.md" @@ -0,0 +1,2428 @@ +--- +base_model: Alibaba-NLP/gte-base-en-v1.5 +datasets: +- davanstrien/query-to-dataset-viewer-descriptions +language: +- en +library_name: sentence-transformers +license: apache-2.0 +metrics: +- cosine_accuracy +- dot_accuracy +- manhattan_accuracy +- euclidean_accuracy +- max_accuracy +pipeline_tag: sentence-similarity +tags: +- sentence-transformers +- sentence-similarity +- feature-extraction +- generated_from_trainer +- dataset_size:1141 +- loss:CachedMultipleNegativesRankingLoss +widget: +- source_sentence: 'USER_QUERY: mental health conversation dataset' + sentences: + - "NEGATIVE: DATASET_NAME: \"llmModeluser/Therapy_sessions_dataset\"\nFEATURES:\ + \ {'conversations': {'dtype': 'string', '_type': 'Value'}, 'id': {'dtype': 'string',\ + \ '_type': 'Value'}}\nDATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\":\ + \ {\n \"conversations\": \"[{'from': 'human', 'value': \\\"I've been feeling\ + \ so sad and overwhelmed lately. Work has become such a massive source of stress\ + \ for me.\\\"}\\n {'from': 'gpt', 'value': \\\"Hey there, I'm here to listen and\ + \ support you. It sounds like work has been really challenging lately. Can you\ + \ tell me more about what's been going on?\\\"}\\n {'from': 'human', 'value':\ + \ \\\"I recently got a promotion at work, which I thought would be exciting. But\ + \ the added responsibilities and pressure have just taken a toll on my mental\ + \ health. It's been a really moving experience for me.\\\"}\\n {'from': 'gpt',\ + \ 'value': \\\"I can understand how it can be overwhelming when we're faced with\ + \ higher expectations. It's okay to acknowledge your emotions and allow yourself\ + \ to feel sad in this situation. It's an important part of the healing process.\ + \ What specific challenges have you been facing at work?\\\"}\\n {'from': 'human',\ + \ 'value': \\\"Well, the workload has increased significantly, and I find it hard\ + \ to maintain a work-life balance. I've been staying late at the office, and it\ + \ feels like I'm constantly under a pile of never-ending tasks. It's just so hard\ + \ to keep up, and it's impacting my overall well-being.\\\"}\\n {'from': 'gpt',\ + \ 'value': \\\"It sounds like you're dealing with a lot of pressure to perform\ + \ and succeed. Remember, it's crucial to take care of yourself, both mentally\ + \ and physically. A healthy work-life balance is essential. Have you tried any\ + \ strategies to cope with the added workload?\\\"}\\n {'from': 'human', 'value':\ + \ \\\"I've been trying to prioritize my tasks and delegate whenever possible.\ + \ I've also started practicing meditation during my breaks to help manage stress.\ + \ But sometimes, it feels like no matter what I do, I can't catch a break. It's\ + \ been a constant struggle.\\\"}\\n {'from': 'gpt', 'value': \\\"It's great to\ + \ hear that you're already implementing some helpful strategies. Remember, progress\ + \ takes time, and it's okay to have setbacks. In addition to what you're already\ + \ doing, I encourage you to also communicate with your supervisor or team about\ + \ your workload and discuss possible solutions together.\\\"}\\n {'from': 'human',\ + \ 'value': \\\"You're right. I haven't really opened up about my struggles to\ + \ my coworkers or supervisor. I guess I've been afraid of appearing weak or incapable.\ + \ How can I approach this discussion without feeling vulnerable?\\\"}\\n {'from':\ + \ 'gpt', 'value': \\\"It's completely normal to feel that way, but remember, asking\ + \ for support is a strength, not a weakness. Start by scheduling a conversation\ + \ with your supervisor or a trusted colleague in a private and comfortable setting.\ + \ Be honest about your challenges and express your willingness to find solutions\ + \ together. Remember, you're not alone in this.\\\"}\\n {'from': 'human', 'value':\ + \ \\\"Thank you for your understanding and guidance. I appreciate the reminder\ + \ that I don't have to face this alone. I'll gather my courage and initiate that\ + \ conversation soon. I need to prioritize my well-being and find a healthier balance.\\\ + \"}\\n {'from': 'gpt', 'value': \\\"You're very welcome! I'm here to support you\ + \ every step of the way. Taking care of yourself should always be a priority.\ + \ Remember to be kind to yourself and celebrate your progress, no matter how small\ + \ it may seem. You've got this!\\\"}]\",\n \"id\": \"identity_0\"\n },\n\ + \ \"truncated_cells\": []\n },\n {\n \"row_idx\": 1,\n \"row\": {\n\ + \ \"conversations\": \"[{'from': 'human', 'value': \\\"Hi, I'm feeling really\ + \ scared about something and I need some advice. It's about friendship.\\\"}\\\ + n {'from': 'gpt', 'value': \\\"Hello there! I'm here to listen and offer support.\ + \ Tell me more about what's going on with your friendship. What specifically is\ + \ making you feel scared?\\\"}\\n {'from': 'human', 'value': \\\"Well, lately,\ + \ I've been feeling like my friends don't value me as much as I value them. It's\ + \ like I'm always the one reaching out and making plans, but they never initiate\ + \ anything. It's making me doubt if our friendship is genuine.\\\"}\\n {'from':\ + \ 'gpt', 'value': \\\"I understand how disheartening that can be. It's important\ + \ to feel valued and appreciated in any relationship. Have you talked to your\ + \ friends about how you're feeling?\\\"}\\n {'from': 'human', 'value': \\\"No,\ + \ I haven't. I'm afraid that they'll think I'm being needy or that they'll get\ + \ defensive. I don't want to ruin our friendship by bringing it up.\\\"}\\n {'from':\ + \ 'gpt', 'value': \\\"It's normal to feel hesitant about having difficult conversations,\ + \ but addressing your concerns is a crucial step towards building stronger connections.\ + \ Before jumping to conclusions, it's important to communicate openly and honestly.\ + \ You deserve to express your feelings.\\\"}\\n {'from': 'human', 'value': \\\"\ + You're right, I should talk to them. But how do I approach the conversation without\ + \ sounding accusatory or angry?\\\"}\\n {'from': 'gpt', 'value': 'That\\\\'s a\ + \ great question! It can be helpful to use \\\"I\\\" statements to express how\ + \ you feel, rather than placing blame on others. For example, you can say, \\\"\ + I\\\\'ve been feeling a bit overlooked lately, and I wanted to talk about our\ + \ friendship to ensure we\\\\'re on the same page.\\\"'}\\n {'from': 'human',\ + \ 'value': \\\"I see, so it's important to frame it as a dialogue rather than\ + \ confrontational. That makes sense.\\\"}\\n {'from': 'gpt', 'value': 'Absolutely!\ + \ Opening up a conversation can lead to better understanding and growth. Remember,\ + \ your feelings are valid, and a healthy friendship should provide a safe space\ + \ to express them. If your friends truly care about you, they will listen and\ + \ try to understand your perspective.'}\\n {'from': 'human', 'value': \\\"It's\ + \ reassuring to hear that. I'll try to gather the courage to talk to them. But\ + \ what if they don't respond well? What if they dismiss my concerns?\\\"}\\n {'from':\ + \ 'gpt', 'value': \\\"It's natural to worry about how others might react, but\ + \ remember, you cannot control their responses. What you can control is how you\ + \ handle the situation. If they dismiss your concerns, it might be an indication\ + \ that they are not as invested in the friendship as you would like. In that case,\ + \ it's essential to focus on surrounding yourself with people who appreciate and\ + \ support you.\\\"}\\n {'from': 'human', 'value': \\\"That's true. I shouldn't\ + \ hold onto toxic relationships. It's just hard to let go sometimes, you know?\\\ + \"}\\n {'from': 'gpt', 'value': \\\"Letting go can be challenging, especially\ + \ when we have invested time and energy into a friendship. It's essential to prioritize\ + \ your well-being and surround yourself with those who uplift you. Remember, you\ + \ deserve friendships that nourish your self-esteem and bring positivity into\ + \ your life.\\\"}\\n {'from': 'human', 'value': 'Thank you for reminding me of\ + \ my worth. I appreciate your advice and support.'}\\n {'from': 'gpt', 'value':\ + \ \\\"You're very welcome! Building healthy friendships is a journey, and you\ + \ have taken the first step by recognizing the need for change. Remember to be\ + \ gentle with yourself as you navigate through this process. You're doing great!\\\ + \"}\\n {'from': 'human', 'value': 'I will, thank you. I feel much better after\ + \ talking to you.'}\\n {'from': 'gpt', 'value': \\\"I'm glad to hear that. Remember,\ + \ I'm here for you whenever you need someone to talk to. Don't hesitate to reach\ + \ out. You've got this!\\\"}]\",\n \"id\": \"identity_1\"\n },\n \"\ + truncated_cells\": []\n }\n]" + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"emozilla/dolma-v1_7-arxiv\"\nFEATURES: {'text':\ + \ {'dtype': 'string', '_type': 'Value'}, 'id': {'dtype': 'string', '_type': 'Value'},\ + \ 'metadata': {'file_path': {'dtype': 'string', '_type': 'Value'}}}\nDATA SAMPLE:\n\ + [\n {\n \"row_idx\": 0,\n \"row\": {\n \"text\": \"\\\\section{Introduction}\\\ + nLet $G$ be a simple undirected graph with the \\\\textit{vertex set} $V(G)$ and\ + \ the \\\\textit{edge set} $E(G)$. A vertex with degree one is called a \\\\textit{pendant\ + \ vertex}. The distance between the vertices $u$ and $v$ in graph $G$ is denoted\ + \ by $d_G(u,v)$. A cycle $C$ is called \\\\textit{chordless} if $C$ has no \\\\\ + textit{cycle chord} (that is an edge not in the edge set of $C$ whose endpoints\ + \ lie on the vertices of $C$).\\nThe \\\\textit{Induced subgraph} on vertex set\ + \ $S$ is denoted by $\\\\langle S\\\\rangle$. A path that starts in $v$ and ends\ + \ in $u$ is denoted by $\\\\stackrel\\\\frown{v u}$.\\nA \\\\textit{traceable}\ + \ graph is a graph that possesses a Hamiltonian path.\\nIn a graph $G$, we say\ + \ that a cycle $C$ is \\\\textit{formed by the path} $Q$ if $ | E(C) \\\\setminus\ + \ E(Q) | = 1 $. So every vertex of $C$ belongs to $V(Q)$.\\n\\nIn 2011 the following\ + \ conjecture was proposed:\\n\\\\begin{conjecture}(Hoffmann-Ostenhof \\\\cite{hoffman})\\\ + nLet $G$ be a connected cubic graph. Then $G$ has a decomposition into a spanning\ + \ tree, a matching and a family of cycles.\\n\\n\\\\end{conjecture}\\nConjecture\ + \ \\\\theconjecture$\\\\,$ also appears in Problem 516 \\\\cite{cameron}. There\ + \ are a few partial results known for Conjecture \\\\theconjecture. Kostochka\ + \ \\\\cite{kostocha} noticed that the Petersen graph, the prisms over cycles,\ + \ and many other graphs have a decomposition desired in Conjecture \\\\theconjecture.\ + \ Ozeki and Ye \\\\cite{ozeki} proved that the conjecture holds for 3-connected\ + \ cubic plane graphs. Furthermore, it was proved by Bachstein \\\\cite{bachstein}\ + \ that Conjecture \\\\theconjecture$\\\\,$ is true for every 3-connected cubic\ + \ graph embedded in torus or Klein-bottle. Akbari, Jensen and Siggers \\\\cite[Theorem\ + \ 9]{akbari} showed that Conjecture \\\\theconjecture$\\\\,$ is true for Hamiltonian\ + \ cubic graphs.\\n\\nIn this paper, we show that Conjecture \\\\theconjecture$\\\ + \\,$ holds for traceable cubic graphs.\\n\\\\section{Results}\\nBefore proving\ + \ the main result, we need the following lemma.\\n\\\\begin{lemma}\\n\\\\label{lemma:1}\\\ + nLet $G$ be a cubic graph. Suppose that $V(G)$ can be partitioned into a tree\ + \ $T$ and finitely many cycles such that there is no edge between any pair of\ + \ cycles (not necessarily distinct cycles), and every pendant vertex of $T$ is\ + \ adjacent to at least one vertex of a cycle. Then, Conjecture \\\\theconjecture$\\\ + \\,$ holds for $G$.\\n\\\\end{lemma}\\n\\\\begin{proof}\\nBy assumption, every\ + \ vertex of each cycle in the partition is adjacent to exactly one vertex of $T$.\ + \ Call the set of all edges with one endpoint in a cycle and another endpoint\ + \ in $T$ by $Q$.\\nClearly, the induced subgraph on $E(T) \\\\cup Q$ is a spanning\ + \ tree of $G$. We call it $T'$. Note that every edge between a pendant vertex\ + \ of $T$ and the union of cycles in the partition is also contained in $T'$. Thus,\ + \ every pendant vertex of $T'$ is contained in a cycle of the partition. Now,\ + \ consider the graph $H = G \\\\setminus E(T')$. For every $v \\\\in V(T)$, $d_H(v)\ + \ \\\\leq 1$. So Conjecture \\\\theconjecture$\\\\,$ holds for $G$. \\\\vspace{1em}\\\ + n\\\\end{proof}\\n\\n\\n\\\\noindent\\\\textbf{Remark 1.}\\n\\\\label{remark:1}\\\ + nLet $C$ be a cycle formed by the path $Q$. Then clearly there exists a chordless\ + \ cycle formed by $Q$.\\n\\nNow, we are in a position to prove the main result.\\\ + n\\n\\\\begin{theorem}\\nConjecture \\\\theconjecture$\\\\,$ holds for traceable\ + \ cubic graphs.\\n\\\\end{theorem}\\n\\\\begin{proof}\\nLet $G$ be a traceable\ + \ cubic graph and $P : v_1, \\\\dots, v_n$ be a Hamiltonian path in $G$. By \\\ + \\cite[Theorem 9]{akbari}, Conjecture A holds for $v_1 v_n \\\\in E(G)$. Thus\ + \ we can assume that $v_1 v_n \\\\notin E(G)$. Let $v_1 v_j, v_1 v_{j'}, v_i\ + \ v_n, v_{i'} v_n \\\\in E(G)\\\\setminus E(P)$ and $j' < j < n$, $1 < i < i'$.\ + \ Two cases can occur:\\n\\\\begin{enumerate}[leftmargin=0pt,label=]\\n\\\\item\\\ + n\\\\textbf{Case 1.}\\nAssume that $i < j$. Consider the following graph in Figure\ + \ \\\\ref{fig:overlapping} in which the thick edges denote the path $P$. Call\ + \ the three paths between $v_j$ and $v_i$, from the left to the right, by $P_1$,\ + \ $P_2$ and $P_3$, respectively (note that $P_1$ contains the edge $e'$ and $P_3$\ + \ contains the edge $e$).\\n\\n\\\\begin{figure}[H]\\n \\\\begin{center}\\n \ + \ \\\\includegraphics[width=40mm]{engImages/overlapping.pdf}\\n \\\\caption{Paths\ + \ $P_1$, $P_2$ and $P_3$}\\n \\\\label{fig:overlapping}\\n \\\\end{center}\\\ + n\\\\end{figure}\\n\\n\\nIf $P_2$ has order $2$, then $G$ is Hamiltonian and so\ + \ by \\\\cite[Theorem 9]{akbari} Conjecture \\\\theconjecture$\\\\,$ holds. Thus\ + \ we can assume that $P_1$, $P_2$ and $P_3$ have order at least $3$. Now, consider\ + \ the following subcases:\\\\\\\\\\n\\n\\\\begin{enumerate}[leftmargin=0pt,label=]\\\ + n\\\\label{case:1}\\n\\\\item \\\\textbf{Subcase 1.} There is no edge between\ + \ $V(P_r)$ and $V(P_s)$ for $1 \\\\leq r < s \\\\leq 3$. Since every vertex of\ + \ $P_i$ has degree 3 for every $i$, by \\\\hyperref[remark:1]{Remark 1}$\\\\,$\ + \ there are two chordless cycles $C_1$ and $C_2$ formed by $P_1$ and $P_2$, respectively.\\\ + nDefine a tree $T$ with the edge set\\n$$ E\\\\Big(\\\\langle V(G) \\\\setminus\ + \ \\\\big(V(C_1) \\\\cup V(C_2)\\\\big) \\\\rangle\\\\Big) \\\\bigcap \\\\big(\\\ + \\bigcup_{i=1}^3 E(P_i)\\\\big).$$\\nNow, apply \\\\hyperref[lemma:1]{Lemma 1}\ + \ $\\\\,$for the partition $\\\\{T, C_1, C_2\\\\}$.\\\\\\\\\\n\\n\\\\item \\\\\ + textbf{Subcase 2.}\\n\\\\label{case:edge}\\nThere exists at least one edge between\ + \ some $P_r$ and $P_s$, $r i'$ such that $v_s\ + \ v_t \\\\in E(G)$. By \\\\hyperref[remark:1]{Remark 1} $\\\\,$ there are two\ + \ chordless cycles $C_1$ and $C_2$, respectively formed by the paths $v_1 v_{j'}$\ + \ and $v_{i'} v_n$. By assumption there is no edge $xy$, where $x \\\\in V(C_1)$\ + \ and $y \\\\in V(C_2)$.\\nDefine a tree $T$ with the edge set:\\n$$ E\\\\Big(\\\ + \\langle V(G) \\\\setminus \\\\big(V(C_1) \\\\cup V(C_2)\\\\big) \\\\rangle \\\ + \\Big) \\\\bigcap \\\\Big( E(P) \\\\cup \\\\{v_{i'}v_n, v_{j'}v_1\\\\} \\\\Big).$$\\\ + nNow, apply \\\\hyperref[lemma:1]{Lemma 1} $\\\\,$for the partition $\\\\{T, C_1,\ + \ C_2\\\\}$.\\\\\\\\\\n\\n\\\\item \\\\textbf{Subcase 2.}\\n\\\\label{subcase:22}\ + \ There are at least four indices $s, s' < j$ and $t, t' > i$ such that $v_s v_t,\ + \ v_{s'} v_{t'} \\\\in E(G)$. Choose four indices $g, h < j$ and $e, f > i$ such\ + \ that $v_h v_e, v_g v_f \\\\in E(G)$ and $|g-h| + |e-f|$ is minimum.\\n\\n\\\\\ + begin{figure}[H]\\n \\\\begin{center}\\n \\\\includegraphics[width=90mm]{engImages/case2-subcase2.pdf}\\\ + n \\\\caption{Two edges $v_h v_e$ and $v_g v_f$}\\n \\\\label{fig:non-overlapping}\\\ + n \\\\end{center}\\n\\\\end{figure}\\n\\nThree cases can be considered:\\\\\\\ + \\\\n\\n\\\\begin{enumerate}[leftmargin=0pt,label=(\\\\alph*)]\\n\\\\item There\ + \ is no chordless cycle formed by $\\\\stackrel\\\\frown{v_g v_h}$ and by $\\\\\ + stackrel\\\\frown{v_e v_f}$.\\n\\nConsider the cycle $\\\\stackrel\\\\frown{v_g\ + \ v_h} \\\\stackrel\\\\frown{v_e v_f}v_g$ and call it $C$. Now, define a tree\ + \ $T$ with the edge set,\\n$$\\\\,\\\\,\\\\,E\\\\Big(\\\\langle V(G) \\\\setminus\ + \ V(C)\\\\rangle \\\\Big) \\\\bigcap \\\\Big( E(P) \\\\cup \\\\{v_1v_{j}, v_{i}v_n\\\ + \\} \\\\Big),$$\\napply \\\\hyperref[lemma:1]{Lemma 1} $\\\\,$for the partition\ + \ $\\\\{T, C\\\\}$.\\\\\\\\\\n\\n\\\\item With no loss of generality, there exists\ + \ a chordless cycle formed by $\\\\stackrel\\\\frown{v_e v_f}$ and there is no\ + \ chordless cycle formed by the path $\\\\stackrel\\\\frown{v_g v_h}$. First suppose\ + \ that there is a chordless cycle $C_1$ formed by $\\\\stackrel\\\\frown{v_e v_f}$\ + \ such that there is no edge between $V(C_1)$ and $\\\\{v_1, \\\\dots, v_j\\\\\ + }$. By \\\\hyperref[remark:1]{Remark 1} $,$ there exists a chordless cycle $C_2$\ + \ formed by $\\\\stackrel\\\\frown{v_1 v_j}$. By assumption there is no edge between\ + \ $V(C_1)$ and $V(C_2)$. Now, define a tree $T$ with the edge set,\\n\\n$$\\\\\ + quad\\\\quad\\\\quad\\\\quad E\\\\Big(\\\\langle V(G) \\\\setminus \\\\big(V(C_1)\ + \ \\\\cup V(C_2)\\\\big)\\\\rangle \\\\Big) \\\\bigcap \\\\Big( E(P) \\\\cup \\\ + \\{v_1v_{j}, v_{i}v_n\\\\} \\\\Big),$$\\n\\nand apply \\\\hyperref[lemma:1]{Lemma\ + \ 1} $\\\\,$for the partition $\\\\{T, C_1, C_2\\\\}$.\\n\\n$\\\\;$ Next assume\ + \ that for every cycle $C_r$ formed by $\\\\stackrel\\\\frown{v_e v_f}$, there\ + \ are two vertices $x_r \\\\in V(C_r)$ and $y_r \\\\in \\\\{v_1, \\\\dots, v_j\\\ + \\}$ such that $x_r y_r \\\\in E(G)$. Let $v_e=w_0, w_1, \\\\dots, w_l=v_f$ be\ + \ all vertices of the path $\\\\stackrel\\\\frown{v_e v_f}$ in $P$. Choose the\ + \ shortest path $w_0 w_{i_1} w_{i_2} \\\\dots w_l$ such that $0 < i_1 < i_2 <\ + \ \\\\dots < l$. Consider the cycle $w_0 w_{i_1} \\\\dots w_l \\\\stackrel\\\\\ + frown{v_g v_h}$ and call it $C$. Now, by removing $C$, $q$ vertex disjoint paths\ + \ $Q_1, \\\\dots, Q_q$ which are contained in $\\\\stackrel\\\\frown{v_e v_f}$\ + \ remain. Note that there exists a path of order $2$ in $C$ which by adding this\ + \ path to $Q_i$ we find a cycle $C_{r_i}$, for some $i$. Hence there exists an\ + \ edge $x_{r_i} y_{r_i}$ connecting $Q_i$ to $V(G) \\\\setminus V(\\\\stackrel\\\ + \\frown{v_e v_f})$. We define a tree $T$ whose edge set is the edges,\\n$$\\\\\ + quad\\\\quad\\\\quad\\\\quad\\\\quad\\\\quad E\\\\Big(\\\\langle V(G) \\\\setminus\ + \ V(C)\\\\rangle \\\\Big) \\\\bigcap \\\\Big( E(P) \\\\cup \\\\{v_1v_{j}, v_{i}v_n\\\ + \\} \\\\cup \\\\big\\\\{x_{r_i} y_{r_i} \\\\mid 1 \\\\leq i \\\\leq q\\\\big\\\ + \\} \\\\Big),$$\\nthen apply \\\\hyperref[lemma:1]{Lemma 1} $\\\\,$ on the partition\ + \ $\\\\{T, C\\\\}$.\\\\\\\\\\n\\\\begin{figure}[H]\\n \\\\begin{center}\\n \ + \ \\\\includegraphics[width=90mm]{engImages/deltaNonOverlapping.pdf}\\n \\\ + \\caption{The tree $T$ and the shortest path $w_0 w_{i_1}\\\\dots w_l$}\\n \ + \ \\\\label{fig:delta-non-overlapping}\\n \\\\end{center}\\n\\\\end{figure}\\\ + n\\n\\\\item There are at least two chordless cycles, say $C_1$ and $C_2$ formed\ + \ by the paths $\\\\stackrel\\\\frown{v_g v_h}$ and $\\\\stackrel\\\\frown{v_e\ + \ v_f}$, respectively. Since $|g-h| + |e-f|$ is minimum, there is no edge $xy\ + \ \\\\in E(G)$ with $x \\\\in V(C_1)$ and $y \\\\in V(C_2)$. Now, define a tree\ + \ $T$ with the edge set,\\n$$\\\\quad\\\\quad\\\\quad\\\\quad E\\\\Big( \\\\langle\ + \ V(G) \\\\setminus \\\\big(V(C_1) \\\\cup V(C_2)\\\\big) \\\\rangle \\\\Big)\ + \ \\\\bigcap \\\\Big( E(P) \\\\cup \\\\{v_1 v_{j}, v_{i}v_n\\\\} \\\\Big),$$\\\ + nand apply \\\\hyperref[lemma:1]{Lemma 1} $\\\\,$for the partition $\\\\{T, C_1,\ + \ C_2\\\\}$.\\\\\\\\\\n\\\\end{enumerate}\\n\\n\\\\item \\\\textbf{Subcase 3.}\ + \ There exist exactly two indices $s,t$, $s < j' < i' < t$ such that $v_s v_t\ + \ \\\\in E(G)$ and there are no two other indices $s', t'$ such that $s' < j <\ + \ i < t'$ and $v_{s'} v_{t'} \\\\in E(G)$. We can assume that there is no cycle\ + \ formed by $\\\\stackrel\\\\frown{v_{s+1} v_j}$ or $\\\\stackrel\\\\frown{v_i\ + \ v_{t-1}}$, to see this by symmetry consider a cycle $C$ formed by $\\\\stackrel\\\ + \\frown{v_{s+1} v_j}$. By \\\\hyperref[remark:1]{Remark 1} $\\\\,$ there exist\ + \ chordless cycles $C_1$ formed by $\\\\stackrel\\\\frown{v_{s+1} v_j}$ and $C_2$\ + \ formed by $\\\\stackrel\\\\frown{v_{i} v_n}$. By assumption $v_s v_t$ is the\ + \ only edge such that $s < j$ and $t > i \\\\;$. Therefore, there is no edge\ + \ between $V(C_1)$ and $V(C_2)$. Now, let $T$ be a tree defined by the edge set,\\\ + n$$ E\\\\Big(\\\\langle V(G) \\\\setminus \\\\big(V(C_1) \\\\cup V(C_2)\\\\big)\\\ + \\rangle \\\\Big) \\\\bigcap \\\\Big( E(P) \\\\cup \\\\{v_1v_{j}, v_{i}v_n\\\\\ + } \\\\Big),$$\\nand apply \\\\hyperref[lemma:1]{Lemma 1} $\\\\,$for the partition\ + \ \\\\{$T$, $C_1$, $C_2$\\\\}.\\\\\\\\\\n\\n$\\\\quad$Furthermore, we can also\ + \ assume that either $s \\\\neq j'-1$ or $t \\\\neq i'+1$, otherwise we have\ + \ the Hamiltonian cycle $\\\\stackrel\\\\frown{v_1 v_s} \\\\stackrel\\\\frown{v_t\ + \ v_n} \\\\stackrel\\\\frown{v_{i'} v_{j'}} v_1$ and by \\\\cite[Theorem 9]{akbari}\ + \ Conjecture \\\\theconjecture$\\\\,$ holds.\\n\\n$\\\\quad$By symmetry, suppose\ + \ that $s \\\\neq j'-1$. Let $v_k$ be the vertex adjacent to $v_{j'-1}$, and $k\ + \ \\\\notin \\\\{j'-2, j'\\\\}$. It can be shown that $k > j'-1$, since otherwise\ + \ by considering the Hamiltonian path $P': \\\\; \\\\stackrel\\\\frown{ v_{k+1}\ + \ v_{j'-1}}\\\\stackrel\\\\frown{v_k v_1} \\\\stackrel\\\\frown{v_{j'} v_n}$,\ + \ the new $i'-j'$ is greater than the old one and this contradicts our assumption\ + \ about $P$ in the \\\\hyperref[case:2]{Case 2}.\\n\\n$\\\\quad$We know that $j'\ + \ < k < i$. Moreover, the fact that $\\\\stackrel\\\\frown{v_{s+1} v_j}$ does\ + \ not form a cycle contradicts the case that $j' < k \\\\le j$. So $j < k < i$.\ + \ Consider two cycles $C_1$ and $C_2$, respectively with the vertices $v_1 \\\\\ + stackrel\\\\frown{v_{j'} v_{j}} v_1$ and $v_n \\\\stackrel\\\\frown{v_{i'} v_{i}}\ + \ v_n$. The cycles $C_1$ and $C_2$ are chordless, otherwise there exist cycles\ + \ formed by the paths $\\\\stackrel\\\\frown{v_{s+1} v_j}$ or $\\\\stackrel\\\\\ + frown{v_i v_{t-1}}$. Now, define a tree $T$ with the edge set\\n$$ E\\\\Big(\\\ + \\langle V(G) \\\\setminus \\\\big(V(C_1) \\\\cup V(C_2)\\\\big)\\\\rangle \\\\\ + Big) \\\\bigcap \\\\Big( E(P) \\\\cup \\\\{v_s v_t, v_k v_{j'-1}\\\\} \\\\Big),$$\\\ + nand apply \\\\hyperref[lemma:1]{Lemma 1} $\\\\,$for the partition \\\\{$T$, $C_1$,\ + \ $C_2$\\\\}.\\n\\\\end{enumerate}\\n\\\\end{enumerate}\\n\\\\end{proof}\\n\\\ + n\\\\noindent\\\\textbf{Remark 2.}\\n\\\\label{remark:2}\\nIndeed, in the proof\ + \ of the previous theorem we showed a stronger result, that is, for every traceable\ + \ cubic graph there is a decomposition with at most two cycles.\\n\\n\",\n \ + \ \"id\": \"b7c40b41b7eedaa408f87d154284a1aba126589c\",\n \"metadata\"\ + : {\n \"file_path\": \"/home/ubuntu/dolma-v1_7/arxiv-0000.json.gz\"\n \ + \ }\n },\n \"truncated_cells\": []\n },\n {\n \"row_idx\": 1,\n\ + \ \"row\": {\n \"text\": \"\\\\section{Principle of nano strain-amplifier}\\\ + r\\n\\r\\n\\\\begin{figure*}[t!]\\r\\n\\t\\\\centering\\r\\n\\t\\\\includegraphics[width=5.4in]{Fig1}\\\ + r\\n\\t\\t\\\\vspace{-0.5em}\\r\\n\\t\\\\caption{Schematic sketches of nanowire\ + \ strain sensors. (a)(b) Conventional non-released and released NW structure;\ + \ \\r\\n\\t\\t(c)(d) The proposed nano strain-amplifier and its simplified physical\ + \ model.}\\r\\n\\t\\\\label{fig:fig1}\\r\\n\\t\\t\\\\vspace{-1em}\\r\\n\\\\end{figure*}\\\ + r\\nFigure \\\\ref{fig:fig1}(a) and 1(b) show the concept of the conventional\ + \ structures of piezoresistive sensors. The piezoresistive elements are either\ + \ released from, or kept on, the substrate. The sensitivity ($S$) of the sensors\ + \ is defined based on the ratio of the relative resistance change ($\\\\Delta\ + \ R/R$) of the sensing element and the strain applied to the substrate ($\\\\\ + varepsilon_{sub}$):\\r\\n\\\\begin{equation}\\r\\nS = (\\\\Delta R/R)/\\\\varepsilon_{sub}\\\ + r\\n\\\\label{eq:sensitivity}\\r\\n\\\\end{equation}\\r\\nIn addition, the relative\ + \ resistance change $\\\\Delta R/R$ can be calculated from the gauge factor ($GF$)\ + \ of the material used to make the piezoresistive elements: $\\\\Delta R/R = GF\ + \ \\\\varepsilon_{ind}$, where $\\\\varepsilon_{ind}$ is the strain induced into\ + \ the piezoresistor. In most of the conventional strain gauges as shown in Fig.\ + \ \\\\ref{fig:fig1} (a,b), the thickness of the sensing layer is typically below\ + \ a few hundred nanometers, which is much smaller than that of the substrate.\ + \ Therefore, the strain induced into the piezoresistive elements is approximately\ + \ the same as that of the substrate ($\\\\varepsilon_{ind} \\\\approx \\\\varepsilon_{sub}$).\ + \ Consequently, to improve the sensitivity of strain sensors (e.g. enlarging $\\\ + \\Delta R/R$), electrical approaches which can enlarge the gauge factor ($GF$)\ + \ are required. Nevertheless, as aforementioned, the existence of the large gauge\ + \ factor in nanowires due to quantum confinement or surface state, is still considered\ + \ as controversial. \\n\\r\\nIt is also evident from Eq. \\\\ref{eq:sensitivity}\ + \ that the sensitivity of strain sensors can also be improved using a mechanical\ + \ approach, which enlarges the strain induced into the piezoresistive element.\ + \ Figure \\\\ref{fig:fig1}(c) shows our proposed nano strain-amplifier structure,\ + \ in which the piezoresistive nanowires are locally fabricated at the centre of\ + \ a released bridge. The key idea of this structure is that, under a certain strain\ + \ applied to the substrate, a large strain will be concentrated at the locally\ + \ fabricated SiC nanowires. The working principle of the nano strain-amplifier\ + \ is similar to that of the well-known dogbone structure, which is widely used\ + \ to characterize the tensile strength of materials \\\\cite{dogbone1,dogbone2}.\ + \ That is, when a stress is applied to the dogbone-shape of a certain material,\ + \ a crack, if generated, will occur at the middle part of the dogbone. The large\ + \ strain concentrated at the narrow area located at the centre part with respect\ + \ to the wider areas located at outer region, causes the crack. Qualitative and\ + \ quantitative explanations of the nano strain-amplifier are presented as follows.\ + \ \\r\\n\\r\\nFor the sake of simplicity, the released micro frame and nanowire\ + \ (single wire or array) of the nano strain-amplifier can be considered as solid\ + \ springs, Fig. \\\\ref{fig:fig1}(d). The stiffness of these springs are proportional\ + \ to their width ($w$) and inversely proportional to their length (l): $K \\\\\ + propto w/l$. Consequently, the model of the released nanowire and micro frames\ + \ can be simplified as a series of springs, where the springs with higher stiffness\ + \ correspond to the micro frame, and the single spring with lower stiffness corresponds\ + \ to the nanowire. It is well-known in classical physics that, for serially connected\ + \ springs, a larger strain will be concentrated in the low--stiffness string,\ + \ while a smaller strain will be induced in the high--stiffness string \\\\cite{Springbook}.\ + \ The following analysis quantitatively explained the amplification of the strain.\\\ + t\\r\\n\\r\\n\\\\begin{figure}[b!]\\r\\n\\t\\\\centering\\r\\n\\t\\\\includegraphics[width=3in]{Fig2}\\\ + r\\n\\t\\\\vspace{-1em}\\r\\n\\t\\\\caption{Finite element analysis of the strain\ + \ induced in to the nanowire array utilizing nano strain-amplifier.}\\r\\n\\t\\\ + \\label{fig:fig2}\\r\\n\\\\end{figure}\\r\\nWhen a tensile mechanical strain ($\\\ + \\varepsilon_{sub}$) is applied to the substrate, the released structure will\ + \ also be elongated. Since the stiffness of the released frame is much smaller\ + \ than that of the substrate, it is safe to assume that the released structure\ + \ will follows the elongation of the substrate. The displacement of the released\ + \ structure $\\\\Delta L$ is:\\r\\n\\\\begin{equation}\\r\\n\\\\Delta L = \\\\\ + Delta L_m + \\\\Delta L_n = L_m \\\\varepsilon_m + L_n \\\\varepsilon_n\\r\\n\\\ + \\label{eq:displacement}\\r\\n\\\\end{equation} \\r\\nwhere $L_m$, $L_n$ are the\ + \ length; $\\\\Delta L_m$, $\\\\Delta L_n$ are the displacement; and $\\\\varepsilon_m$,\ + \ $\\\\varepsilon_n$ are the strains induced into the micro spring and nano spring,\ + \ respectively. The subscripts m and n stand for the micro frames and nanowires,\ + \ respectively. Furthermore, due to the equilibrium of the stressing force ($F$)\ + \ along the series of springs, the following relationship is established: $F=\ + \ K_m\\\\Delta L_m = K_n \\\\Delta L_n$, where $K_m$, $K_n$ are the stiffness\ + \ of the released micro frames and nanowires, respectively. Consequently the relationship\ + \ between the displacement of the micro frame (higher stiffness) and nanowires\ + \ (lower stiffness) is:\\r\\n\\\\begin{equation}\\r\\n\\\\frac{\\\\Delta L_m}{\\\ + \\Delta L_n}=\\\\frac{K_n}{K_m}=\\\\frac{L_mw_n}{L_nw_m}\\r\\n\\\\label{eq:euili}\\\ + r\\n\\\\end{equation}\\r\\nSubstituting Eqn. \\\\ref{eq:euili} into Eqn. \\\\\ + ref{eq:displacement}, the strain induced into the locally fabricated nanowires\ + \ is:\\r\\n\\\\begin{equation}\\r\\n\\\\varepsilon_n = \\\\frac{\\\\Delta L_n}{L_n}\ + \ = \\\\frac{1}{1-\\\\frac{w_m-w_n}{w_m}\\\\frac{L_m}{L}}\\\\varepsilon_{sub}\\\ + r\\n\\\\label{eq:strainamp}\\r\\n\\\\end{equation} \\r\\n\\r\\nEquation \\\\ref{eq:strainamp}\ + \ indicates that increasing the ratio of $w_m/w_n$ and $L_m/L_n$ significantly\ + \ amplifies the strain induced into the nanowire from the strain applied to the\ + \ substrate. This model is also applicable to the case of nanowire arrays, in\ + \ which $w_n$ is the total width of all nanowires in the array.\\n\\r\\nThe theoretical\ + \ model is then verified using the finite element analysis (FEA). In the FEA simulation,\ + \ we compare the strain induced into (i) non released nanowires, (ii) the conventionally\ + \ released nanowires, and (iii) our nano strain-amplifier structure, using COMSOL\ + \ Multiphysics \\\\texttrademark. In our nano strain amplifying structure, the\ + \ width of the released frame was set to be 8 $\\\\mu$m, while the width of each\ + \ nanowire in the array (3 wires) was set to be 370 nm. The nanowires array structure\ + \ was selected as it can enhance the electrical conductance of the SiC nanowires\ + \ resistor which makes the subsequent experimental demonstration easier. The ratio\ + \ between the length of nanowires and micro bridge was set to be 1: 20. With this\ + \ geometrical dimensions, strain induced into nanowires array $\\\\varepsilon_n$\ + \ was numerically calculated to be approximately 6 times larger than $\\\\varepsilon_{sub}$,\ + \ Eqn. \\\\ref{eq:strainamp}. The simulation results show that for all structure,\ + \ the elongation of non-released and released nanowires follow that of the substrate.\ + \ In addition, strain was almost completely transferred into conventional released\ + \ and non-released structures. Furthermore, the ratio of the strain induced in\ + \ to the locally fabricated nanowires was estimated to be 5.9 times larger than\ + \ that of the substrate, Fig. \\\\ref{fig:fig2}. These results are in solid agreement\ + \ with the theoretical analysis presented above. For a nanowire array with an\ + \ average width of 470 nm, the amplified gain of strain was found to be 4.5. \ + \ \\t\\r\\n\\r\\nBased on the theoretical analysis, we conducted the following\ + \ experiments to demonstrate the high sensitivity of SiC nanowire strain sensors\ + \ using the nano strain-amplifier. A thin 3C-SiC film with its thickness of 300\ + \ nm was epitaxially grown on a 150 mm diameter Si wafer using low pressure chemical\ + \ vapour deposition \\\\cite{SiC_growth}. The film was \\\\emph{in situ} doped\ + \ using Al dopants. The carrier concentration of the p-type 3C-SiC was found to\ + \ be $5 \\\\times 10^{18}$ cm$^{-3}$, using a hot probe technique \\\\cite{philip}.\ + \ The details of the characteristics of the grown film can be found elsewhere\ + \ \\\\cite{Phan_JMC}. Subsequently, I-shape p-type SiC resistors with aluminum\ + \ electrodes deposited on the surface were patterned using inductive coupled plasma\ + \ (ICP) etching. As the piezoresistance of p-type 3C-SiC depends on crystallographic\ + \ orientation, all SiC resistors of the present work were aligned along [110]\ + \ direction to maximize the piezoresistive effect. Next, the micro scale SiC resistors\ + \ were then released from the Si substrate using dry etching (XeF$_2$). Finally,\ + \ SiC nanowire arrays were formed at the centre of the released bridge using focused\ + \ ion beam (FIB). Two types of nanowire array were fabricated with three nanowires\ + \ for each array. The average width of each nanowire in each type were 380 nm\ + \ and 470 nm, respectively. Figure \\\\ref{fig:fig3} shows the SEM images of the\ + \ fabricated samples, including the conventional released structure, non-released\ + \ nanowires, and the nano strain-amplifier. \\r\\n\\r\\n\\\\begin{figure}[t!]\\\ + r\\n\\t\\\\centering\\r\\n\\t\\\\includegraphics[width=3in]{Fig3}\\r\\n\\t\\\\\ + caption{SEM image of SiC strain sensors. (a) Released SiC micro bridge used for\ + \ the subsequent fabrication of the nano strain-amplifier; (b) SEM of a micro\ + \ SiC resistor where the SiC nanowires array were formed using FIB; (c) SEM of\ + \ non-released SiC nanowires; (d) SEM of locally fabricated SiC nanowires released\ + \ from the Si substrate (nano strain-amplifier).}\\r\\n\\t\\\\label{fig:fig3}\\\ + r\\n\\t\\\\vspace{-1em}\\r\\n\\\\end{figure}\\r\\nThe current voltage (I-V) curves\ + \ of all fabricated samples were characterized using a HP 4145 \\\\texttrademark\ + \ ~parameter analyzer. The linear relationship between the applied voltage and\ + \ measured current, indicated that Al made a good Ohmic contact with the highly\ + \ doped SiC resistance, Fig. \\\\ref{fig:IV}. Additionally, the electrical conductivity\ + \ of both nanowires and micro frame estimated from the I-V curve and the dimensions\ + \ of the resistors shows almost the same value. This indicated that the FIB process\ + \ did not cause a significant surface damage to the fabricated nanowires. \\\ + r\\n\\t\\r\\n\\\\begin{figure}[b!]\\r\\n\\t\\\\centering\\r\\n\\t\\\\includegraphics[width=3in]{Fig4}\\\ + r\\n\\t\\t\\\\vspace{-1.5em}\\r\\n\\t\\\\caption{Current voltage curves of the\ + \ fabricated SiC resistors.}\\r\\n\\t\\\\label{fig:IV}\\r\\n\\n\\\\end{figure}\\\ + r\\n\\r\\nThe bending experiment was used to characterize the piezoresistive effect\ + \ in micro size SiC resistors and locally fabricated SiC nanowire array. In this\ + \ experiment one end of the Si cantilever (with a thickness of 625 $\\\\mu$m,\ + \ and a width of 7 mm) was fixed while the other end was deflected by applying\ + \ different forces. The distance from the fabricated nanowires to the free end\ + \ of the Si cantilever was approximately 45 mm. The strain induced into the Si\ + \ substrate is $\\\\varepsilon_\\\\text{sub} = Mt/2EI$, where $M$ is the applied\ + \ bending moment; and $t$, $E$ and $I$ are the thickness, Young's modulus and\ + \ the moment of inertia of the Si cantilever, respectively. The response of the\ + \ SiC resistance to applied strain was then measured using a multimeter (Agilent\ + \ \\\\texttrademark 34401 A).\\n\\r\\n\\\\begin{figure}[h!]\\r\\n\\t\\\\centering\\\ + r\\n\\t\\\\includegraphics[width=3in]{Fig5.eps}\\r\\n\\t\\t\\\\vspace{-1.5em}\\\ + r\\n\\t\\\\caption{Experimental results. (a) A comparision between the relative\ + \ resistance change in the nano strain-amplifiers, non released nanowires and\ + \ released micro frames; (b) The repeatability of the SiC nanowires strain sensors\ + \ utilizing the proposed structure.}\\r\\n\\t\\\\label{fig:DRR}\\r\\n\\t\\t\\\ + t\\\\vspace{-1em}\\r\\n\\\\end{figure}\\t\\r\\nThe relative resistance change\ + \ ($\\\\Delta R/R$) of the micro and nano SiC resistors was plotted against the\ + \ strain induced into the Si substrate $\\\\varepsilon_{sub}$, Fig. \\\\ref{fig:DRR}(a).\ + \ For all fabricated samples, the relative resistance change shows a good linear\ + \ relationship with the applied strain ($\\\\varepsilon_{sub}$). In addition,\ + \ with the same applied strain to the Si substrate, the resistance change of the\ + \ SiC nanowires using the nano strain-amplifier was much larger than that of the\ + \ the SiC micro resistor and the conventional non-released SiC nanowires. In addition,\ + \ reducing the width of the SiC nanowires also resulted in the increase of the\ + \ sensitivity. The magnitude of the piezoresistive effect in the nano strain-amplifier\ + \ as well as conventional structures were then quantitatively evaluated based\ + \ on the effective gauge factor ($GF_{eff}$), which is defined as the ratio of\ + \ the relative resistance change to the applied strain to the substrate: $GF_{eff}\ + \ = (\\\\Delta R/R)/\\\\varepsilon_{sub}$. Accordingly, the effective gauge factor\ + \ of the released micro SiC was found to be 28, while that of the non-released\ + \ SiC nanowires was 35. From the data shown in Fig. \\\\ref{fig:DRR}, the effective\ + \ gauge factor of the 380 nm and 470 nm SiC nanowires in the nano strain-amplifier\ + \ were calculated as 150 and 124, respectively. Thus for nanowire arrays with\ + \ average widths of 380 nm and 470 nm, the sensitivity of the nano strain-amplifier\ + \ was 5.4 times and 4.6 times larger than the bulk SiC, respectively. These results\ + \ were consistent with analytical and numerical models presented above. The relative\ + \ resistance change of the nano strain-amplifier also showed excellent linearity\ + \ with the applied strain, with a linear regression of above 99\\\\%. \\r\\n\\\ + r\\nThe resistance change of the nano strain-amplifier can also be converted into\ + \ voltage signals using a Wheatstone bridge, Fig. \\\\ref{fig:DRR}(b). The output\ + \ voltage of the nano strain-amplifier increases with increasing tensile strains\ + \ from 0 ppm to 180 ppm, and returned to the initial value when the strain was\ + \ completely removed, confirming a good repeatability after several strain induced\ + \ cycles. The linearity of the relative resistance change, and the repeatability\ + \ indicate that the proposed structure is promising for strain sensing applications.\\\ + r\\n \\r\\nIn conclusion, this work presents a novel mechanical approach to\ + \ obtain highly sensitive piezoresistance in nanowires based on a nano strain-amplifier.\ + \ The key factor of the nano strain-amplifier lies on nanowires locally fabricated\ + \ on a released micro structure. Experimental studies were conducted on SiC nanowires,\ + \ confirming that by utilizing our nano strain-amplifier, the sensitivity of SiC\ + \ nanowires was 5.4 times larger than that of conventional structures. This result\ + \ indicated that the nano strain-amplifier is an excellent platform for ultra\ + \ sensitive strain sensing applications. \\r\\n\\r\\n\\r\\n\",\n \"id\"\ + : \"1b77ae9f541b19668cc96624c7ec0f83945284e2\",\n \"metadata\": {\n \ + \ \"file_path\": \"/home/ubuntu/dolma-v1_7/arxiv-0000.json.gz\"\n }\n \ + \ },\n \"truncated_cells\": []\n }\n]" + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"KelvinTichana2/mentalhealthcurated\"\nFEATURES:\ + \ {'Human': {'dtype': 'string', '_type': 'Value'}, 'Assistant': {'dtype': 'string',\ + \ '_type': 'Value'}}\nDATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\":\ + \ {\n \"Human\": \"hello, hey, hi, good day, greetings, what's up?, how is\ + \ it going\",\n \"Assistant\": \"Hello! How are you today!, Hey! What's up,\ + \ Hey, How are you feeling today\"\n },\n \"truncated_cells\": []\n },\n\ + \ {\n \"row_idx\": 1,\n \"row\": {\n \"Human\": \"cya, see you later,\ + \ goodbye, Have a good day, bye, I am leaving\",\n \"Assistant\": \"Talk\ + \ to you later!, Bye!, Goodbye!\"\n },\n \"truncated_cells\": []\n }\n]" +- source_sentence: 'USER_QUERY: named entity recognition dataset conll2003' + sentences: + - "NEGATIVE: DATASET_NAME: \"whoisjones/litset\"\nFEATURES: {'id': {'dtype': 'int64',\ + \ '_type': 'Value'}, 'tokens': {'feature': {'dtype': 'string', '_type': 'Value'},\ + \ '_type': 'Sequence'}, 'ner_tags': {'feature': {'dtype': 'int64', '_type': 'Value'},\ + \ '_type': 'Sequence'}}\nDATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\"\ + : {\n \"id\": 1,\n \"tokens\": [\n \"A\",\n \"few\",\n\ + \ \"examples\",\n \"of\",\n \"autistic\",\n \"symptoms\"\ + ,\n \"and\",\n \"treatments\",\n \"were\",\n \"described\"\ + ,\n \"long\",\n \"before\",\n \"autism\",\n \"was\"\ + ,\n \"named\",\n \".\"\n ],\n \"ner_tags\": [\n \ + \ 0,\n 0,\n 0,\n 0,\n 0,\n 0,\n 0,\n\ + \ 0,\n 0,\n 0,\n 0,\n 0,\n 0,\n \ + \ 0,\n 0,\n 0\n ]\n },\n \"truncated_cells\": []\n\ + \ },\n {\n \"row_idx\": 1,\n \"row\": {\n \"id\": 2,\n \"tokens\"\ + : [\n \"The\",\n \"Table\",\n \"Talk\",\n \"of\",\n\ + \ \"Martin\",\n \"Luther\",\n \",\",\n \"compiled\"\ + ,\n \"by\",\n \"his\",\n \"notetaker\",\n \",\",\n\ + \ \"Mathesius\",\n \",\",\n \"contains\",\n \"the\"\ + ,\n \"story\",\n \"of\",\n \"a\",\n \"12\",\n \ + \ \"year\",\n \"old\",\n \"boy\",\n \"who\",\n \ + \ \"may\",\n \"have\",\n \"been\",\n \"severely\",\n \ + \ \"autistic\",\n \".\"\n ],\n \"ner_tags\": [\n 0,\n\ + \ 717291,\n 717291,\n 0,\n 578735,\n 578735,\n\ + \ 0,\n 0,\n 0,\n 0,\n 0,\n 0,\n \ + \ 0,\n 0,\n 0,\n 0,\n 0,\n 0,\n 0,\n\ + \ 0,\n 0,\n 0,\n 0,\n 0,\n 0,\n \ + \ 0,\n 0,\n 0,\n 0,\n 0\n ]\n },\n \"\ + truncated_cells\": []\n }\n]" + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"ZhongshengWang/alpaca-booksum\"\nFEATURES:\ + \ {'input': {'dtype': 'string', '_type': 'Value'}, 'output': {'dtype': 'string',\ + \ '_type': 'Value'}, 'instruction': {'dtype': 'string', '_type': 'Value'}}\nDATA\ + \ SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\": {\n \"instruction\"\ + : \"Please complete the task of abstracting and extracting text content from different\ + \ domains, where input is the content of the article and output is the result\ + \ of the summary.\",\n \"input\": \"\\n \\\"Mine ear is open, and my heart\ + \ prepared:\\n The worst is worldly loss thou canst unfold:\\n Say, is my kingdom\ + \ lost?\\\"\\n\\n SHAKESPEARE.\\n\\n\\nIt was a feature peculiar to the colonial\ + \ wars of North America, that\\nthe toils and dangers of the wilderness were to\ + \ be encountered before\\nthe adverse hosts could meet. A wide and apparently\ + \ an impervious\\nboundary of forests severed the possessions of the hostile provinces\ + \ of\\nFrance and England. The hardy colonist, and the trained European who\\\ + nfought at his side, frequently expended months in struggling against the\\nrapids\ + \ of the streams, or in effecting the rugged passes of the\\nmountains, in quest\ + \ of an opportunity to exhibit their courage in a more\\nmartial conflict. But,\ + \ emulating the patience and self-denial of the\\npractised native warriors, they\ + \ learned to overcome every difficulty;\\nand it would seem that, in time, there\ + \ was no recess of the woods so\\ndark, nor any secret place so lovely, that it\ + \ might claim exemption from\\nthe inroads of those who had pledged their blood\ + \ to satiate their\\nvengeance, or to uphold the cold and selfish policy of the\ + \ distant\\nmonarchs of Europe.\\n\\nPerhaps no district throughout the wide extent\ + \ of the intermediate\\nfrontiers can furnish a livelier picture of the cruelty\ + \ and fierceness\\nof the savage warfare of those periods than the country which\ + \ lies\\nbetween the head waters of the Hudson and the adjacent lakes.\\n\\nThe\ + \ facilities which nature had there offered to the march of the\\ncombatants were\ + \ too obvious to be neglected. The lengthened sheet of the\\nChamplain stretched\ + \ from the frontiers of Canada, deep within the\\nborders of the neighboring province\ + \ of New York, forming a natural\\npassage across half the distance that the French\ + \ were compelled to\\nmaster in order to strike their enemies. Near its southern\ + \ termination,\\nit received the contributions of another lake, whose waters were\ + \ so\\nlimpid as to have been exclusively selected by the Jesuit missionaries\\\ + nto perform the typical purification of baptism, and to obtain for it the\\ntitle\ + \ of lake \\\"du Saint Sacrement.\\\" The less zealous English thought\\nthey\ + \ conferred a sufficient honor on its unsullied fountains, when they\\nbestowed\ + \ the name of their reigning prince, the second of the house of\\nHanover. The\ + \ two united to rob the untutored possessors of its wooded\\nscenery of their\ + \ native right to perpetuate its original appellation of\\n\\\"Horican.\\\"[1]\\\ + n\\nWinding its way among countless islands, and imbedded in mountains, the\\\ + n\\\"holy lake\\\" extended a dozen leagues still farther to the south. With\\\ + nthe high plain that there interposed itself to the further passage of\\nthe water,\ + \ commenced a portage of as many miles, which conducted the\\nadventurer to the\ + \ banks of the Hudson, at a point where, with the usual\\nobstructions of the\ + \ rapids, or rifts, as they were then termed in the\\nlanguage of the country,\ + \ the river became navigable to the tide.\\n\\nWhile, in the pursuit of their\ + \ daring plans of annoyance, the restless\\nenterprise of the French even attempted\ + \ the distant and difficult gorges\\nof the Alleghany, it may easily be imagined\ + \ that their proverbial\\nacuteness would not overlook the natural advantages\ + \ of the district we\\nhave just described. It became, emphatically, the bloody\ + \ arena, in which\\nmost of the battles for the mastery of the colonies were contested.\\\ + nForts were erected at the different points that commanded the facilities\\nof\ + \ the route, and were taken and retaken, razed and rebuilt, as victory\\nalighted\ + \ on the hostile banners. While the husbandman shrank back from\\nthe dangerous\ + \ passes, within the safer boundaries of the more ancient\\nsettlements, armies\ + \ larger than those that had often disposed of the\\nsceptres of the mother countries,\ + \ were seen to bury themselves in these\\nforests, whence they rarely returned\ + \ but in skeleton bands, that were\\nhaggard with care, or dejected by defeat.\ + \ Though the arts of peace were\\nunknown to this fatal region, its forests were\ + \ alive with men; its\\nshades and glens rang with the sounds of martial music,\ + \ and the echoes\\nof its mountains threw back the laugh, or repeated the wanton\ + \ cry, of\\nmany a gallant and reckless youth, as he hurried by them, in the\\\ + nnoontide of his spirits, to slumber in a long night of forgetfulness.\\n\\nIt\ + \ was in this scene of strife and bloodshed that the incidents we shall\\nattempt\ + \ to relate occurred, during the third year of the war which\\nEngland and France\ + \ last waged for the possession of a country that\\nneither was destined to retain.\\\ + n\\nThe imbecility of her military leaders abroad, and the fatal want of\\nenergy\ + \ in her councils at home, had lowered the character of Great\\nBritain from the\ + \ proud elevation on which it had been placed, by the\\ntalents and enterprise\ + \ of her former warriors and statesmen. No longer\\ndreaded by her enemies, her\ + \ servants were fast losing the confidence of\\nself-respect. In this mortifying\ + \ abasement, the colonists, though\\ninnocent of her imbecility, and too humble\ + \ to be the agents of her\\nblunders, were but the natural participators.\\n\\\ + nThey had recently seen a chosen army from that country, which,\\nreverencing\ + \ as a mother, they had blindly believed invincible--an army\\nled by a chief\ + \ who had been selected from a crowd of trained warriors,\\nfor his rare military\ + \ endowments, disgracefully routed by a handful of\\nFrench and Indians, and only\ + \ saved from annihilation by the coolness and\\nspirit of a Virginian boy, whose\ + \ riper fame has since diffused itself,\\nwith the steady influence of moral truth,\ + \ to the uttermost confines of\\nChristendom.[2] A wide frontier had been laid\ + \ naked by this unexpected\\ndisaster, and more substantial evils were preceded\ + \ by a thousand\\nfanciful and imaginary dangers. The alarmed colonists believed\ + \ that the\\nyells of the savages mingled with every fitful gust of wind that\ + \ issued\\nfrom the interminable forests of the west. The terrific character of\\\ + ntheir merciless enemies increased immeasurably the natural horrors of\\nwarfare.\ + \ Numberless recent massacres were still vivid in their\\nrecollections; nor was\ + \ there any ear in the provinces so deaf as not to\\nhave drunk in with avidity\ + \ the narrative of some fearful tale of\\nmidnight murder, in which the natives\ + \ of the forests were the principal\\nand barbarous actors. As the credulous and\ + \ excited traveller related the\\nhazardous chances of the wilderness, the blood\ + \ of the timid curdled\\nwith terror, and mothers cast anxious glances even at\ + \ those children\\nwhich slumbered within the security of the largest towns. In\ + \ short, the\\nmagnifying influence of fear began to set at naught the calculations\ + \ of\\nreason, and to render those who should have remembered their manhood,\\\ + nthe slaves of the basest of passions. Even the most confident and the\\nstoutest\ + \ hearts began to think the issue of the contest was becoming\\ndoubtful; and\ + \ that abject class was hourly increasing in numbers, who\\nthought they foresaw\ + \ all the possessions of the English crown in America\\nsubdued by their Christian\ + \ foes, or laid waste by the inroads of their\\nrelentless allies.\\n\\nWhen,\ + \ therefore, intelligence was received at the fort, which covered\\nthe southern\ + \ termination of the portage between the Hudson and the\\nlakes, that Montcalm\ + \ had been seen moving up the Champlain, with an army\\n\\\"numerous as the leaves\ + \ on the trees,\\\" its truth was admitted with more\\nof the craven reluctance\ + \ of fear than with the stern joy that a warrior\\nshould feel, in finding an\ + \ enemy within reach of his blow. The news had\\nbeen brought, towards the decline\ + \ of a day in midsummer, by an Indian\\nrunner, who also bore an urgent request\ + \ from Munro, the commander of a\\nwork on the shore of the \\\"holy lake,\\\"\ + \ for a speedy and powerful\\nreinforcement. It has already been mentioned that\ + \ the distance between\\nthese two posts was less than five leagues. The rude\ + \ path, which\\noriginally formed their line of communication, had been widened\ + \ for the\\npassage of wagons; so that the distance which had been travelled by\ + \ the\\nson of the forest in two hours, might easily be effected by a detachment\\\ + nof troops, with their necessary baggage, between the rising and setting\\nof\ + \ a summer sun. The loyal servants of the British crown had given to\\none of\ + \ these forest fastnesses the name of William Henry, and to the\\nother that of\ + \ Fort Edward; calling each after a favorite prince of the\\nreigning family.\ + \ The veteran Scotchman just named held the first, with a\\nregiment of regulars\ + \ and a few provincials; a force really by far too\\nsmall to make head against\ + \ the formidable power that Montcalm was\\nleading to the foot of his earthen\ + \ mounds. At the latter, however, lay\\nGeneral Webb, who commanded the armies\ + \ of the king in the northern\\nprovinces, with a body of more than five thousand\ + \ men. By uniting the\\nseveral detachments of his command, this officer might\ + \ have arrayed\\nnearly double that number of combatants against the enterprising\\\ + nFrenchman, who had ventured so far from his reinforcements, with an army\\nbut\ + \ little superior in numbers.\\n\\nBut under the influence of their degraded fortunes,\ + \ both officers and\\nmen appeared better disposed to await the approach of their\ + \ formidable\\nantagonists, within their works, than to resist the progress of\ + \ their\\nmarch, by emulating the successful example of the French at Fort du\\\ + nQuesne, and striking a blow on their advance.\\n\\nAfter the first surprise of\ + \ the intelligence had a little abated, a\\nrumor was spread through the entrenched\ + \ camp, which stretched along the\\nmargin of the Hudson, forming a chain of outworks\ + \ to the body of the\\nfort itself, that a chosen detachment of fifteen hundred\ + \ men was to\\ndepart, with the dawn, for William Henry, the post at the northern\\\ + nextremity of the portage. That which at first was only rumor, soon\\nbecame certainty,\ + \ as orders passed from the quarters of the\\ncommander-in-chief to the several\ + \ corps he had selected for this\\nservice, to prepare for their speedy departure.\ + \ All doubt as to the\\nintention of Webb now vanished, and an hour or two of\ + \ hurried footsteps\\nand anxious faces succeeded. The novice in the military\ + \ art flew from\\npoint to point, retarding his own preparations by the excess\ + \ of his\\nviolent and somewhat distempered zeal; while the more practised veteran\\\ + nmade his arrangements with a deliberation that scorned every appearance\\nof\ + \ haste; though his sober lineaments and anxious eye sufficiently\\nbetrayed that\ + \ he had no very strong professional relish for the as yet\\nuntried and dreaded\ + \ warfare of the wilderness. At length the sun set in\\na flood of glory, behind\ + \ the distant western hills, and as darkness drew\\nits veil around the secluded\ + \ spot the sounds of preparation diminished;\\nthe last light finally disappeared\ + \ from the log cabin of some officer;\\nthe trees cast their deeper shadows over\ + \ the mounds and the rippling\\nstream, and a silence soon pervaded the camp,\ + \ as deep as that which\\nreigned in the vast forest by which it was environed.\\\ + n\\nAccording to the orders of the preceding night, the heavy sleep of the\\narmy\ + \ was broken by the rolling of the warning drums, whose rattling\\nechoes were\ + \ heard issuing, on the damp morning air, out of every vista\\nof the woods, just\ + \ as day began to draw the shaggy outlines of some tall\\npines of the vicinity,\ + \ on the opening brightness of a soft and cloudless\\neastern sky. In an instant\ + \ the whole camp was in motion; the meanest\\nsoldier arousing from his lair to\ + \ witness the departure of his\\ncomrades, and to share in the excitement and\ + \ incidents of the hour. The\\nsimple array of the chosen band was soon completed.\ + \ While the regular\\nand trained hirelings of the king marched with haughtiness\ + \ to the right\\nof the line, the less pretending colonists took their humbler\ + \ position\\non its left, with a docility that long practice had rendered easy.\ + \ The\\nscouts departed; strong guards preceded and followed the lumbering\\nvehicles\ + \ that bore the baggage; and before the gray light of the morning\\nwas mellowed\ + \ by the rays of the sun, the main body of the combatants\\nwheeled into column,\ + \ and left the encampment with a show of high\\nmilitary bearing, that served\ + \ to drown the slumbering apprehensions of\\nmany a novice, who was now about\ + \ to make his first essay in arms. While\\nin view of their admiring comrades,\ + \ the same proud front and ordered\\narray was observed, until the notes of their\ + \ fifes growing fainter in\\ndistance, the forest at length appeared to swallow\ + \ up the living mass\\nwhich had slowly entered its bosom.\\n\\nThe deepest sounds\ + \ of the retiring and invisible column had ceased to be\\nborne on the breeze\ + \ to the listeners, and the latest straggler had\\nalready disappeared in pursuit;\ + \ but there still remained the signs of\\nanother departure, before a log cabin\ + \ of unusual size and\\naccommodations, in front of which those sentinels paced\ + \ their rounds,\\nwho were known to guard the person of the English general. At\ + \ this spot\\nwere gathered some half dozen horses, caparisoned in a manner which\\\ + nshowed that two, at least, were destined to bear the persons of females,\\nof\ + \ a rank that it was not usual to meet so far in the wilds of the\\ncountry. A\ + \ third wore the trappings and arms of an officer of the staff;\\nwhile the rest,\ + \ from the plainness of the housings, and the travelling\\nmails with which they\ + \ were encumbered, were evidently fitted for the\\nreception of as many menials,\ + \ who were, seemingly, already awaiting the\\npleasure of those they served. At\ + \ a respectful distance from this\\nunusual show were gathered divers groups of\ + \ curious idlers; some\\nadmiring the blood and bone of the high-mettled military\ + \ charger, and\\nothers gazing at the preparations, with dull wonder of vulgar\ + \ curiosity.\\nThere was one man, however, who, by his countenance and actions,\ + \ formed\\na marked exception to those who composed the latter class of spectators,\\\ + nbeing neither idle, nor seemingly very ignorant.\\n\\nThe person of this individual\ + \ was to the last degree ungainly, without\\nbeing in any particular manner deformed.\ + \ He had all the bones and joints\\nof other men, without any of their proportions.\ + \ Erect, his stature\\nsurpassed that of his fellows; seated, he appeared reduced\ + \ within the\\nordinary limits of the race. The same contrariety in his members\ + \ seemed\\nto exist throughout the whole man. His head was large; his shoulders\\\ + nnarrow; his arms long and dangling; while his hands were small, if not\\ndelicate.\ + \ His legs and thighs were thin, nearly to emaciation, but of\\nextraordinary\ + \ length; and his knees would have been considered\\ntremendous, had they not\ + \ been outdone by the broader foundations on\\nwhich this false superstructure\ + \ of the blended human orders was so\\nprofanely reared. The ill-assorted and\ + \ injudicious attire of the\\nindividual only served to render his awkwardness\ + \ more conspicuous. A\\nsky-blue coat, with short and broad skirts and low cape,\ + \ exposed a long\\nthin neck, and longer and thinner legs, to the worst animadversions\ + \ of\\nthe evil disposed. His nether garment was of yellow nankeen, closely\\\ + nfitted to the shape, and tied at his bunches of knees by large knots of\\nwhite\ + \ ribbon, a good deal sullied by use. Clouded cotton stockings, and\\nshoes, on\ + \ one of the latter of which was a plated spur, completed the\\ncostume of the\ + \ lower extremity of this figure, no curve or angle of\\nwhich was concealed,\ + \ but, on the other hand, studiously exhibited,\\nthrough the vanity or simplicity\ + \ of its owner. From beneath the flap of\\nan enormous pocket of a soiled vest\ + \ of embossed silk, heavily ornamented\\nwith tarnished silver lace, projected\ + \ an instrument, which, from being\\nseen in such martial company, might have\ + \ been easily mistaken for some\\nmischievous and unknown implement of war. Small\ + \ as it was, this uncommon\\nengine had excited the curiosity of most of the Europeans\ + \ in the camp,\\nthough several of the provincials were seen to handle it, not\ + \ only\\nwithout fear, but with the utmost familiarity. A large, civil cocked\\\ + nhat, like those worn by clergymen within the last thirty years,\\nsurmounted\ + \ the whole, furnishing dignity to a good-natured and somewhat\\nvacant countenance,\ + \ that apparently needed such artificial aid, to\\nsupport the gravity of some\ + \ high and extraordinary trust.\\n\\nWhile the common herd stood aloof, in deference\ + \ to the quarters of Webb,\\nthe figure we have described stalked in the centre\ + \ of the domestics,\\nfreely expressing his censures or commendations on the merits\ + \ of the\\nhorses, as by chance they displeased or satisfied his judgment.\\n\\\ + n\\\"This beast, I rather conclude, friend, is not of home raising, but is\\nfrom\ + \ foreign lands, or perhaps from the little island itself over the\\nblue water?\\\ + \" he said, in a voice as remarkable for the softness and\\nsweetness of its tones,\ + \ as was his person for its rare proportions: \\\"I\\nmay speak of these things,\ + \ and be no braggart; for I have been down at\\nboth havens; that which is situate\ + \ at the mouth of Thames, and is named\\nafter the capital of Old England, and\ + \ that which is called 'Haven,' with\\nthe addition of the word 'New'; and have\ + \ seen the snows and brigantines\\ncollecting their droves, like the gathering\ + \ to the ark, being outward\\nbound to the Island of Jamaica, for the purpose\ + \ of barter and traffic in\\nfour-footed animals; but never before have I beheld\ + \ a beast which\\nverified the true Scripture war-horse like this: 'He paweth\ + \ in the\\nvalley, and rejoiceth in his strength: he goeth on to meet the armed\\\ + nmen. He saith among the trumpets, Ha, ha; and he smelleth the battle\\nafar off,\ + \ the thunder of the captains, and the shouting.' It would seem\\nthat the stock\ + \ of the horse of Israel has descended to our own time;\\nwould it not, friend?\\\ + \"\\n\\nReceiving no reply to this extraordinary appeal, which in truth, as it\\\ + nwas delivered with the vigor of full and sonorous tones, merited some\\nsort\ + \ of notice, he who had thus sung forth the language of the Holy Book\\nturned\ + \ to the silent figure to whom he had unwittingly addressed\\nhimself, and found\ + \ a new and more powerful subject of admiration in the\\nobject that encountered\ + \ his gaze. His eyes fell on the still, upright,\\nand rigid form of the \\\"\ + Indian runner,\\\" who had borne to the camp the\\nunwelcome tidings of the preceding\ + \ evening. Although in a state of\\nperfect repose, and apparently disregarding,\ + \ with characteristic\\nstoicism, the excitement and bustle around him, there\ + \ was a sullen\\nfierceness mingled with the quiet of the savage, that was likely\ + \ to\\narrest the attention of much more experienced eyes than those which now\\\ + nscanned him, in unconcealed amazement. The native bore both the tomahawk\\nand\ + \ knife of his tribe; and yet his appearance was not altogether that\\nof a warrior.\ + \ On the contrary, there was an air of neglect about his\\nperson, like that which\ + \ might have proceeded from great and recent\\nexertion, which he had not yet\ + \ found leisure to repair. The colors of\\nthe war-paint had blended in dark confusion\ + \ about his fierce\\ncountenance, and rendered his swarthy lineaments still more\ + \ savage and\\nrepulsive than if art had attempted an effect which had been thus\\\ + nproduced by chance. His eye, alone, which glistened like a fiery star\\namid\ + \ lowering clouds, was to be seen in its state of native wildness.\\nFor a single\ + \ instant, his searching and yet wary glance met the\\nwondering look of the other,\ + \ and then changing its direction, partly in\\ncunning, and partly in disdain,\ + \ it remained fixed, as if penetrating the\\ndistant air.\\n\\nIt is impossible\ + \ to say what unlooked-for remark this short and silent\\ncommunication, between\ + \ two such singular men, might have elicited from\\nthe white man, had not his\ + \ active curiosity been again drawn to other\\nobjects. A general movement among\ + \ the domestics, and a low sound of\\ngentle voices, announced the approach of\ + \ those whose presence alone was\\nwanted to enable the cavalcade to move. The\ + \ simple admirer of the\\nwar-horse instantly fell back to a low, gaunt, switch-tailed\ + \ mare, that\\nwas unconsciously gleaning the faded herbage of the camp nigh by;\ + \ where,\\nleaning with one elbow on the blanket that concealed an apology for\ + \ a\\nsaddle, he became a spectator of the departure, while a foal was quietly\\\ + nmaking its morning repast, on the opposite side of the same animal.\\n\\nA young\ + \ man, in the dress of an officer, conducted to their steeds two\\nfemales, who,\ + \ as it was apparent by their dresses, were prepared to\\nencounter the fatigues\ + \ of a journey in the woods. One, and she was the\\nmost juvenile in her appearance,\ + \ though both were young, permitted\\nglimpses of her dazzling complexion, fair\ + \ golden hair, and bright blue\\neyes, to be caught, as she artlessly suffered\ + \ the morning air to blow\\naside the green veil which descended low from her\ + \ beaver. The flush\\nwhich still lingered above the pines in the western sky\ + \ was not more\\nbright nor delicate than the bloom on her cheek; nor was the\ + \ opening day\\nmore cheering than the animated smile which she bestowed on the\ + \ youth,\\nas he assisted her into the saddle. The other, who appeared to share\\\ + nequally in the attentions of the young officer, concealed her charms\\nfrom the\ + \ gaze of the soldiery, with a care that seemed better fitted to\\nthe experience\ + \ of four or five additional years. It could be seen,\\nhowever, that her person,\ + \ though moulded with the same exquisite\\nproportions, of which none of the graces\ + \ were lost by the travelling\\ndress she wore, was rather fuller and more mature\ + \ than that of her\\ncompanion.\\n\\nNo sooner were these females seated, than\ + \ their attendant sprang lightly\\ninto the saddle of the war-horse, when the\ + \ whole three bowed to Webb,\\nwho, in courtesy, awaited their parting on the\ + \ threshold of his cabin,\\nand turning their horses' heads, they proceeded at\ + \ a slow amble,\\nfollowed by their train, towards the northern entrance of the\\\ + nencampment. As they traversed that short distance, not a voice was\\nheard amongst\ + \ them; but a slight exclamation proceeded from the younger\\nof the females,\ + \ as the Indian runner glided by her, unexpectedly, and\\nled the way along the\ + \ military road in her front. Though this sudden and\\nstartling movement of the\ + \ Indian produced no sound from the other, in\\nthe surprise her veil also was\ + \ allowed to open its folds, and betrayed\\nan indescribable look of pity, admiration,\ + \ and horror, as her dark eye\\nfollowed the easy motions of the savage. The tresses\ + \ of this lady were\\nshining and black, like the plumage of the raven. Her complexion\ + \ was not\\nbrown, but it rather appeared charged with the color of the rich blood,\\\ + nthat seemed ready to burst its bounds. And yet there was neither\\ncoarseness\ + \ nor want of shadowing in a countenance that was exquisitely\\nregular and dignified,\ + \ and surpassingly beautiful. She smiled, as if in\\npity at her own momentary\ + \ forgetfulness, discovering by the act a row of\\nteeth that would have shamed\ + \ the purest ivory; when, replacing the veil,\\nshe bowed her face, and rode in\ + \ silence, like one whose thoughts were\\nabstracted from the scene around her.\\\ + n\\n\\n\\n\\n \\\"Sola, sola, wo, ha, ho, sola!\\\"\\n\\n SHAKESPEARE.\\n\\\ + n\\nWhile one of the lovely beings we have so cursorily presented to the\\nreader\ + \ was thus lost in thought, the other quickly recovered from the\\nalarm which\ + \ induced the exclamation, and, laughing at her own weakness,\\nshe inquired of\ + \ the youth who rode by her side,--\\n\\n\\\"Are such spectres frequent in the\ + \ woods, Heyward; or is this sight an\\nespecial entertainment on our behalf?\ + \ If the latter, gratitude must\\nclose our mouths; but if the former, both Cora\ + \ and I shall have need to\\ndraw largely on that stock of hereditary courage\ + \ which we boast, even\\nbefore we are made to encounter the redoubtable Montcalm.\\\ + \"\\n\\n\\\"Yon Indian is a 'runner' of the army; and, after the fashion of his\\\ + npeople, he may be accounted a hero,\\\" returned the officer. \\\"He has\\nvolunteered\ + \ to guide us to the lake, by a path but little known, sooner\\nthan if we followed\ + \ the tardy movements of the column: and, by\\nconsequence, more agreeably.\\\"\ + \\n\\n\\\"I like him not,\\\" said the lady, shuddering, partly in assumed, yet\ + \ more\\nin real terror. \\\"You know him, Duncan, or you would not trust yourself\\\ + nso freely to his keeping?\\\"\\n\\n\\\"Say, rather, Alice, that I would not trust\ + \ you. I do know him, or he\\nwould not have my confidence, and least of all at\ + \ this moment. He is\\nsaid to be a Canadian, too; and yet he served with our\ + \ friends the\\nMohawks, who, as you know, are one of the six allied nations.[3]\ + \ He was\\nbrought among us, as I have heard, by some strange accident in which\\\ + nyour father was interested, and in which the savage was rigidly dealt\\nby--but\ + \ I forget the idle tale; it is enough, that he is now our\\nfriend.\\\"\\n\\\ + n\\\"If he has been my father's enemy, I like him still less!\\\" exclaimed the\\\ + nnow really anxious girl. \\\"Will you not speak to him, Major Heyward, that\\\ + nI may hear his tones? Foolish though it may be, you have often heard me\\navow\ + \ my faith in the tones of the human voice!\\\"\\n\\n\\\"It would be in vain;\ + \ and answered, most probably, by an ejaculation.\\nThough he may understand it,\ + \ he affects, like most of his people, to be\\nignorant of the English; and least\ + \ of all will he condescend to speak\\nit, now that war demands the utmost exercise\ + \ of his dignity. But he\\nstops; the private path by which we are to journey\ + \ is, doubtless, at\\nhand.\\\"\\n\\nThe conjecture of Major Heyward was true.\ + \ When they reached the spot\\nwhere the Indian stood, pointing into the thicket\ + \ that fringed the\\nmilitary road, a narrow and blind path, which might, with\ + \ some little\\ninconvenience, receive one person at a time, became visible.\\\ + n\\n\\\"Here, then, lies our way,\\\" said the young man, in a low voice.\\n\\\ + \"Manifest no distrust, or you may invite the danger you appear to\\napprehend.\\\ + \"\\n\\n\\\"Cora, what think you?\\\" asked the reluctant fair one. \\\"If we\ + \ journey\\nwith the troops, though we may find their presence irksome, shall\ + \ we not\\nfeel better assurance of our safety?\\\"\\n\\n\\\"Being little accustomed\ + \ to the practices of the savages, Alice, you\\nmistake the place of real danger,\\\ + \" said Heyward. \\\"If enemies have\\nreached the portage at all, a thing by\ + \ no means probable, as our scouts\\nare abroad, they will surely be found skirting\ + \ the column where scalps\\nabound the most. The route of the detachment is known,\ + \ while ours,\\nhaving been determined within the hour, must still be secret.\\\ + \"\\n\\n\\\"Should we distrust the man because his manners are not our manners,\ + \ and\\nthat his skin is dark?\\\" coldly asked Cora.\\n\\nAlice hesitated no\ + \ longer; but giving her Narragansett[4] a smart cut\\nof the whip, she was the\ + \ first to dash aside the slight branches of the\\nbushes, and to follow the runner\ + \ along the dark and tangled pathway. The\\nyoung man regarded the last speaker\ + \ in open admiration, and even\\npermitted her fairer though certainly not more\ + \ beautiful companion to\\nproceed unattended, while he sedulously opened the\ + \ way himself for the\\npassage of her who has been called Cora. It would seem\ + \ that the\\ndomestics had been previously instructed; for, instead of penetrating\\\ + nthe thicket, they followed the route of the column; a measure which\\nHeyward\ + \ stated had been dictated by the sagacity of their guide, in\\norder to diminish\ + \ the marks of their trail, if, haply, the Canadian\\nsavages should be lurking\ + \ so far in advance of their army. For many\\nminutes the intricacy of the route\ + \ admitted of no further dialogue;\\nafter which they emerged from the broad border\ + \ of underbrush which grew\\nalong the line of the highway, and entered under\ + \ the high but dark\\narches of the forest. Here their progress was less interrupted,\ + \ and the\\ninstant the guide perceived that the females could command their steeds,\\\ + nhe moved on, at a pace between a trot and a walk, and at a rate which\\nkept\ + \ the sure-footed and peculiar animals they rode, at a fast yet easy\\namble.\ + \ The youth had turned to speak to the dark-eyed Cora, when the\\ndistant sound\ + \ of horses' hoofs, clattering over the roots of the broken\\nway in his rear,\ + \ caused him to check his charger; and, as his companions\\ndrew their reins at\ + \ the same instant, the whole party came to a halt, in\\norder to obtain an explanation\ + \ of the unlooked-for interruption.\\n\\nIn a few moments a colt was seen gliding,\ + \ like a fallow-deer, among the\\nstraight trunks of the pines; and, in another\ + \ instant, the person of the\\nungainly man described in the preceding chapter,\ + \ came into view, with as\\nmuch rapidity as he could excite his meagre beast\ + \ to endure without\\ncoming to an open rupture. Until now this personage had\ + \ escaped the\\nobservation of the travellers. If he possessed the power to arrest\ + \ any\\nwandering eye when exhibiting the glories of his altitude on foot, his\\\ + nequestrian graces were still more likely to attract attention.\\nNotwithstanding\ + \ a constant application of his one armed heel to the\\nflanks of the mare, the\ + \ most confirmed gait that he could establish was\\na Canterbury gallop with the\ + \ hind legs, in which those more forward\\nassisted for doubtful moments, though\ + \ generally content to maintain a\\nloping trot. Perhaps the rapidity of the changes\ + \ from one of these paces\\nto the other created an optical illusion, which might\ + \ thus magnify the\\npowers of the beast; for it is certain that Heyward, who\ + \ possessed a\\ntrue eye for the merits of a horse, was unable, with his utmost\\\ + ningenuity, to decide by what sort of movement his pursuer worked his\\nsinuous\ + \ way on his footsteps with such persevering hardihood.\\n\\nThe industry and\ + \ movements of the rider were not less remarkable than\\nthose of the ridden.\ + \ At each change in the evolutions of the latter, the\\nformer raised his tall\ + \ person in the stirrups; producing, in this\\nmanner, by the undue elongation\ + \ of his legs, such sudden growths and\\ndiminishings of the stature, as baffled\ + \ every conjecture that might be\\nmade as to his dimensions. If to this be added\ + \ the fact that, in\\nconsequence of the ex parte application of the spur, one\ + \ side of the\\nmare appeared to journey faster than the other; and that the aggrieved\\\ + nflank was resolutely indicated by unremitted flourishes of a bushy tail,\\nwe\ + \ finish the picture of both horse and man.\\n\\nThe frown which had gathered\ + \ around the handsome, open, and manly brow\\nof Heyward, gradually relaxed, and\ + \ his lips curled into a slight smile,\\nas he regarded the stranger. Alice made\ + \ no very powerful effort to\\ncontrol her merriment; and even the dark, thoughtful\ + \ eye of Cora lighted\\nwith a humor that, it would seem, the habit, rather than\ + \ the nature of\\nits mistress repressed.\\n\\n\\\"Seek you any here?\\\" demanded\ + \ Heyward, when the other had arrived\\nsufficiently nigh to abate his speed;\ + \ \\\"I trust you are no messenger of\\nevil tidings?\\\"\\n\\n\\\"Even so,\\\"\ + \ replied the stranger, making diligent use of his triangular\\ncastor, to produce\ + \ a circulation in the close air of the woods, and\\nleaving his hearers in doubt\ + \ to which of the young man's questions he\\nresponded; when, however, he had\ + \ cooled his face, and recovered his\\nbreath, he continued, \\\"I hear you are\ + \ riding to William Henry; as I am\\njourneying thitherward myself, I concluded\ + \ good company would seem\\nconsistent to the wishes of both parties.\\\"\\n\\\ + n\\\"You appear to possess the privilege of a casting vote,\\\" returned\\nHeyward;\ + \ \\\"we are three, whilst you have consulted no one but yourself.\\\"\\n\\n\\\ + \"Even so. The first point to be obtained is to know one's own mind. Once\\nsure\ + \ of that, and where women are concerned, it is not easy, the next\\nis, to act\ + \ up to the decision. I have endeavored to do both, and here I\\nam.\\\"\\n\\\ + n\\\"If you journey to the lake, you have mistaken your route,\\\" said\\nHeyward,\ + \ haughtily; \\\"the highway thither is at least half a mile behind\\nyou.\\\"\ + \\n\\n\\\"Even so,\\\" returned the stranger, nothing daunted by this cold\\nreception;\ + \ \\\"I have tarried at 'Edward' a week, and I should be dumb not\\nto have inquired\ + \ the road I was to journey; and if dumb there would be\\nan end to my calling.\\\ + \" After simpering in a small way, like one whose\\nmodesty prohibited a more\ + \ open expression of his admiration of a\\nwitticism that was perfectly unintelligible\ + \ to his hearers, he\\ncontinued: \\\"It is not prudent for any one of my profession\ + \ to be too\\nfamiliar with those he is to instruct; for which reason I follow\ + \ not the\\nline of the army; besides which, I conclude that a gentleman of your\\\ + ncharacter has the best judgment in matters of wayfaring; I have\\ntherefore decided\ + \ to join company, in order that the ride may be made\\nagreeable, and partake\ + \ of social communion.\\\"\\n\\n\\\"A most arbitrary, if not a hasty decision!\\\ + \" exclaimed Heyward,\\nundecided whether to give vent to his growing anger, or\ + \ to laugh in the\\nother's face. \\\"But you speak of instruction, and of a profession;\ + \ are\\nyou an adjunct to the provincial corps, as a master of the noble science\\\ + nof defence and offence; or, perhaps, you are one who draws lines and\\nangles,\ + \ under the pretence of expounding the mathematics?\\\"\\n\\nThe stranger regarded\ + \ his interrogator a moment, in wonder; and then,\\nlosing every mark of self-satisfaction\ + \ in an expression of solemn\\nhumility, he answered:--\\n\\n\\\"Of offence, I\ + \ hope there is none, to either party: of defence, I make\\nnone--by God's good\ + \ mercy, having committed no palpable sin since last\\nentreating his pardoning\ + \ grace. I understand not your allusions about\\nlines and angles; and I leave\ + \ expounding to those who have been called\\nand set apart for that holy office.\ + \ I lay claim to no higher gift than a\\nsmall insight into the glorious art of\ + \ petitioning and thanksgiving, as\\npractised in psalmody.\\\"\\n\\n\\\"The man\ + \ is, most manifestly, a disciple of Apollo,\\\" cried the amused\\nAlice, \\\"\ + and I take him under my own especial protection. Nay, throw\\naside that frown,\ + \ Heyward, and in pity to my longing ears, suffer him to\\njourney in our train.\ + \ Besides,\\\" she added, in a low and hurried voice,\\ncasting a glance at the\ + \ distant Cora, who slowly followed the footsteps\\nof their silent but sullen\ + \ guide, \\\"it may be a friend added to our\\nstrength, in time of need.\\\"\\\ + n\\n\\\"Think you, Alice, that I would trust those I love by this secret path,\\\ + ndid I imagine such need could happen?\\\"\\n\\n\\\"Nay, nay, I think not of it\ + \ now; but this strange man amuses me; and if\\nhe 'hath music in his soul,' let\ + \ us not churlishly reject his company.\\\"\\nShe pointed persuasively along the\ + \ path with her riding-whip, while\\ntheir eyes met in a look which the young\ + \ man lingered a moment to\\nprolong; then yielding to her gentle influence, he\ + \ clapped his spurs\\ninto his charger, and in a few bounds was again at the side\ + \ of Cora.\\n\\n\\\"I am glad to encounter thee, friend,\\\" continued the maiden,\ + \ waving her\\nhand to the stranger to proceed, as she urged her Narragansett\ + \ to renew\\nits amble. \\\"Partial relatives have almost persuaded me that I\ + \ am not\\nentirely worthless in a duet myself; and we may enliven our wayfaring\ + \ by\\nindulging in our favorite pursuit. It might be of signal advantage to\\\ + none, ignorant as I, to hear the opinions and experience of a master in\\nthe\ + \ art.\\\"\\n\\n\\\"It is refreshing both to the spirits and to the body to indulge\ + \ in\\npsalmody, in befitting seasons,\\\" returned the master of song,\\nunhesitatingly\ + \ complying with her intimation to follow; \\\"and nothing\\nwould relieve the\ + \ mind more than such a consoling communion. But four\\nparts are altogether necessary\ + \ to the perfection of melody. You have all\\nthe manifestations of a soft and\ + \ rich treble; I can, by especial aid,\\ncarry a full tenor to the highest letter;\ + \ but we lack counter and bass!\\nYon officer of the king, who hesitated to admit\ + \ me to his company, might\\nfill the latter, if one may judge from the intonations\ + \ of his voice in\\ncommon dialogue.\\\"\\n\\n\\\"Judge not too rashly from hasty\ + \ and deceptive appearances,\\\" said the\\nlady, smiling; \\\"though Major Heyward\ + \ can assume such deep notes on\\noccasion, believe me, his natural tones are\ + \ better fitted for a mellow\\ntenor than the bass you heard.\\\"\\n\\n\\\"Is\ + \ he, then, much practised in the art of psalmody?\\\" demanded her\\nsimple companion.\\\ + n\\nAlice felt disposed to laugh, though she succeeded in suppressing her\\nmerriment,\ + \ ere she answered,--\\n\\n\\\"I apprehend that he is rather addicted to profane\ + \ song. The chances of\\na soldier's life are but little fitted for the encouragement\ + \ of more\\nsober inclinations.\\\"\\n\\n\\\"Man's voice is given to him, like\ + \ his other talents, to be used, and\\nnot to be abused. None can say they have\ + \ ever known me neglect my gifts!\\nI am thankful that, though my boyhood may\ + \ be said to have been set\\napart, like the youth of the royal David, for the\ + \ purposes of music, no\\nsyllable of rude verse has ever profaned my lips.\\\"\ + \\n\\n\\\"You have, then, limited your efforts to sacred song?\\\"\\n\\n\\\"Even\ + \ so. As the psalms of David exceed all other language, so does the\\npsalmody\ + \ that has been fitted to them by the divines and sages of the\\nland, surpass\ + \ all vain poetry. Happily, I may say that I utter nothing\\nbut the thoughts\ + \ and the wishes of the King of Israel himself; for\\nthough the times may call\ + \ for some slight changes, yet does this version\\nwhich we use in the colonies\ + \ of New England, so much exceed all other\\nversions, that, by its richness,\ + \ its exactness, and its spiritual\\nsimplicity, it approacheth, as near as may\ + \ be, to the great work of the\\ninspired writer. I never abide in any place,\ + \ sleeping or waking, without\\nan example of this gifted work. 'Tis the six-and-twentieth\ + \ edition,\\npromulgated at Boston, Anno Domini 1744; and is entitled, _The Psalms,\\\ + nHymns, and Spiritual Songs of the Old and New Testaments; faithfully\\ntranslated\ + \ into English Metre, for the Use, Edification, and Comfort of\\nthe Saints, in\ + \ Public and Private, especially in New England_.\\\"\\n\\nDuring this eulogium\ + \ on the rare production of his native poets, the\\nstranger had drawn the book\ + \ from his pocket, and, fitting a pair of\\niron-rimmed spectacles to his nose,\ + \ opened the volume with a care and\\nveneration suited to its sacred purposes.\ + \ Then, without circumlocution\\nor apology, first pronouncing the word \\\"Standish,\\\ + \" and placing the\\nunknown engine, already described, to his mouth, from which\ + \ he drew a\\nhigh, shrill sound, that was followed by an octave below, from his\ + \ own\\nvoice, he commenced singing the following words, in full, sweet, and\\\ + nmelodious tones, that set the music, the poetry, and even the uneasy\\nmotion\ + \ of his ill-trained beast at defiance:--\\n\\n \\\"How good it is, O see,\\\ + n And how it pleaseth well,\\n Together, e'en in unity,\\n For brethren\ + \ so to dwell.\\n It's like the choice ointment,\\n From the head to the beard\ + \ did go:\\n Down Aaron's beard, that downward went,\\n His garment's skirts\ + \ unto.\\\"\\n\\nThe delivery of these skilful rhymes was accompanied, on the\ + \ part of the\\nstranger, by a regular rise and fall of his right hand, which\\\ + nterminated at the descent, by suffering the fingers to dwell a moment on\\nthe\ + \ leaves of the little volume; and on the ascent, by such a flourish\\nof the\ + \ member as none but the initiated may ever hope to imitate. It\\nwould seem that\ + \ long practice had rendered this manual accompaniment\\nnecessary; for it did\ + \ not cease until the preposition which the poet had\\nselected for the close\ + \ of his verse, had been duly delivered like a word\\nof two syllables.\\n\\nSuch\ + \ an innovation on the silence and retirement of the forest could not\\nfail to\ + \ enlist the ears of those who journeyed at so short a distance in\\nadvance.\ + \ The Indian muttered a few words in broken English to Heyward,\\nwho, in his\ + \ turn, spoke to the stranger; at once interrupting, and, for\\nthe time, closing\ + \ his musical efforts.\\n\\n\\\"Though we are not in danger, common prudence would\ + \ teach us to journey\\nthrough this wilderness in as quiet a manner as possible.\ + \ You will,\\nthen, pardon me, Alice, should I diminish your enjoyments, by requesting\\\ + nthis gentleman to postpone his chant until a safer opportunity.\\\"\\n\\n\\\"\ + You will diminish them, indeed,\\\" returned the arch girl, \\\"for never did\\\ + nI hear a more unworthy conjunction of execution and language, than that\\nto\ + \ which I have been listening; and I was far gone in a learned inquiry\\ninto\ + \ the causes of such an unfitness between sound and sense, when you\\nbroke the\ + \ charm of my musings by that bass of yours, Duncan!\\\"\\n\\n\\\"I know not what\ + \ you call my bass,\\\" said Heyward, piqued at her remark,\\n\\\"but I know that\ + \ your safety, and that of Cora, is far dearer to me than\\ncould be any orchestra\ + \ of Handel's music.\\\" He paused and turned his head\\nquickly towards a thicket,\ + \ and then bent his eyes suspiciously on their\\nguide, who continued his steady\ + \ pace, in undisturbed gravity. The young\\nman smiled to himself, for he believed\ + \ he had mistaken some shining\\nberry of the woods for the glistening eyeballs\ + \ of a prowling savage, and\\nhe rode forward, continuing the conversation which\ + \ had been interrupted\\nby the passing thought.\\n\\nMajor Heyward was mistaken\ + \ only in suffering his youthful and generous\\npride to suppress his active watchfulness.\ + \ The cavalcade had not long\\npassed, before the branches of the bushes that\ + \ formed the thicket were\\ncautiously moved asunder, and a human visage, as fiercely\ + \ wild as savage\\nart and unbridled passions could make it, peered out on the\ + \ retiring\\nfootsteps of the travellers. A gleam of exultation shot across the\\\ + ndarkly painted lineaments of the inhabitant of the forest, as he traced\\nthe\ + \ route of his intended victims, who rode unconsciously onward; the\\nlight and\ + \ graceful forms of the females waving among the trees, in the\\ncurvatures of\ + \ their path, followed at each bend by the manly figure of\\nHeyward, until, finally,\ + \ the shapeless person of the singing-master was\\nconcealed behind the numberless\ + \ trunks of trees, that rose, in dark\\nlines, in the intermediate space.\\n\\\ + n\\n\\n\",\n \"output\": \"Before any characters appear, the time and geography\ + \ are made clear. Though it is the last war that England and France waged for\ + \ a country that neither would retain, the wilderness between the forces still\ + \ has to be overcome first. Thus it is in 1757, in the New York area between the\ + \ head waters of the Hudson River and Lake George to the north. Because only two\ + \ years earlier General Braddock was disgracefully routed by a handful of French\ + \ and Indians, the frontier is now exposed to real and imaginary savage disasters\ + \ as well as to the horrors of warfare. Fear has replaced reason. Near dusk of\ + \ a day in July, an Indian runner named Magua arrives at Fort Edward on the upper\ + \ Hudson. He has come from Fort William Henry at the southern tip of Lake George\ + \ with the news that the French General Montcalm is moving south with a very large\ + \ army and that Munro, commander of Fort William Henry, is in urgent need of plentiful\ + \ reinforcements from General Webb. Early the next morning, a limited detachment\ + \ of fifteen hundred regulars and colonists departs as if swallowed by the forest.\ + \ Shortly afterwards, Major Duncan Heyward and Alice and Cora Munro, guided by\ + \ Magua on foot, take by horseback a secret route toward William Henry for the\ + \ girls to join their father. Blonde Alice is doubtful about Magua, covered with\ + \ war paint and showing a sullen fierceness; but dark-haired Cora is stoically\ + \ common sense about him, even though Heyward mentions that their father had once\ + \ had to deal rigidly with the Indian. As the small party pushes on, they are\ + \ overtaken by David Gamut, a tall, ungainly psalmodist ridiculously dressed and\ + \ carrying a pitch pipe while riding a mare followed by its young colt. He desires\ + \ to join them, and after some banter between him and Alice, he pulls out the\ + \ twenty-sixth edition of The Bay Psalm Book, sounds his pipe, and renders a song\ + \ \\\"in full, sweet, and melodious tones.\\\" At a muttered comment from Magua,\ + \ Heyward insists upon silence for safety. Then he glances about them and, satisfied\ + \ that he has seen only shining berries, smiles to himself as they move on. But\ + \ he is wrong. The branches move and a man peers exultingly after them as they\ + \ disappear among the dark lines of trees.\"\n },\n \"truncated_cells\"\ + : []\n },\n {\n \"row_idx\": 1,\n \"row\": {\n \"instruction\": \"\ + Please complete the task of abstracting and extracting text content from different\ + \ domains, where input is the content of the article and output is the result\ + \ of the summary.\",\n \"input\": \"\\n \\\"Before these fields were shorn\ + \ and tilled,\\n Full to the brim our rivers flowed;\\n The melody of waters\ + \ filled\\n The fresh and boundless wood;\\n And torrents dashed, and rivulets\ + \ played,\\n And fountains spouted in the shade.\\\"\\n\\n BRYANT.\\n\\n\\\ + nLeaving the unsuspecting Heyward and his confiding companions to\\npenetrate\ + \ still deeper into a forest that contained such treacherous\\ninmates, we must\ + \ use an author's privilege, and shift the scene a few\\nmiles to the westward\ + \ of the place where we have last seen them.\\n\\nOn that day, two men were lingering\ + \ on the banks of a small but rapid\\nstream, within an hour's journey of the\ + \ encampment of Webb, like those\\nwho awaited the appearance of an absent person,\ + \ or the approach of some\\nexpected event. The vast canopy of woods spread itself\ + \ to the margin of\\nthe river overhanging the water, and shadowing its dark current\ + \ with a\\ndeeper hue. The rays of the sun were beginning to grow less fierce,\ + \ and\\nthe intense heat of the day was lessened, as the cooler vapors of the\\\ + nsprings and fountains rose above their leafy beds, and rested in the\\natmosphere.\ + \ Still that breathing silence, which marks the drowsy\\nsultriness of an American\ + \ landscape in July, pervaded the secluded spot,\\ninterrupted only by the low\ + \ voices of the men, the occasional and lazy\\ntap of a woodpecker, the discordant\ + \ cry of some gaudy jay, or a swelling\\non the ear, from the dull roar of a distant\ + \ waterfall.\\n\\nThese feeble and broken sounds were, however, too familiar to\ + \ the\\nforesters, to draw their attention from the more interesting matter of\\\ + ntheir dialogue. While one of these loiterers showed the red skin and\\nwild accoutrements\ + \ of a native of the woods, the other exhibited,\\nthrough the mask of his rude\ + \ and nearly savage equipments, the brighter,\\nthough sunburnt and long-faded\ + \ complexion of one who might claim descent\\nfrom a European parentage. The former\ + \ was seated on the end of a mossy\\nlog, in a posture that permitted him to heighten\ + \ the effect of his\\nearnest language, by the calm but expressive gestures of\ + \ an Indian\\nengaged in debate. His body, which was nearly naked, presented a\\\ + nterrific emblem of death, drawn in intermingled colors of white and\\nblack.\ + \ His closely shaved head, on which no other hair than the well\\nknown and chivalrous\ + \ scalping tuft[5] was preserved, was without\\nornament of any kind, with the\ + \ exception of a solitary eagle's plume,\\nthat crossed his crown, and depended\ + \ over the left shoulder. A tomahawk\\nand scalping-knife, of English manufacture,\ + \ were in his girdle; while a\\nshort military rifle, of that sort with which\ + \ the policy of the whites\\narmed their savage allies, lay carelessly across\ + \ his bare and sinewy\\nknee. The expanded chest, full formed limbs, and grave\ + \ countenance of\\nthis warrior, would denote that he had reached the vigor of\ + \ his days,\\nthough no symptoms of decay appeared to have yet weakened his manhood.\\\ + n\\nThe frame of the white man, judging by such parts as were not concealed\\\ + nby his clothes, was like that of one who had known hardships and\\nexertion from\ + \ his earliest youth. His person, though muscular, was\\nrather attenuated than\ + \ full; but every nerve and muscle appeared strung\\nand indurated by unremitted\ + \ exposure and toil. He wore a hunting-shirt\\nof forest green, fringed with faded\ + \ yellow[6], and a summer cap of skins\\nwhich had been shorn of their fur. He\ + \ also bore a knife in a girdle of\\nwampum, like that which confined the scanty\ + \ garments of the Indian, but\\nno tomahawk. His moccasins were ornamented after\ + \ the gay fashion of the\\nnatives, while the only part of his under-dress which\ + \ appeared below the\\nhunting-frock, was a pair of buckskin leggings, that laced\ + \ at the sides,\\nand which were gartered above the knees with the sinews of a\ + \ deer. A\\npouch and horn completed his personal accoutrements, though a rifle\ + \ of\\ngreat length[7], which the theory of the more ingenious whites had\\ntaught\ + \ them was the most dangerous of all fire-arms, leaned against a\\nneighboring\ + \ sapling. The eye of the hunter, or scout, whichever he might\\nbe, was small,\ + \ quick, keen, and restless, roving while he spoke, on\\nevery side of him, as\ + \ if in quest of game, or distrusting the sudden\\napproach of some lurking enemy.\ + \ Notwithstanding the symptoms of habitual\\nsuspicion, his countenance was not\ + \ only without guile, but at the moment\\nat which he is introduced, it was charged\ + \ with an expression of sturdy\\nhonesty.\\n\\n\\\"Even your traditions make the\ + \ case in my favor, Chingachgook,\\\" he said,\\nspeaking in the tongue which\ + \ was known to all the natives who formerly\\ninhabited the country between the\ + \ Hudson and the Potomac, and of which\\nwe shall give a free translation for\ + \ the benefit of the reader;\\nendeavoring, at the same time, to preserve some\ + \ of the peculiarities,\\nboth of the individual and of the language. \\\"Your\ + \ fathers came from the\\nsetting sun, crossed the big river,[8] fought the people\ + \ of the country,\\nand took the land; and mine came from the red sky of the morning,\ + \ over\\nthe salt lake, and did their work much after the fashion that had been\\\ + nset them by yours; then let God judge the matter between us, and friends\\nspare\ + \ their words!\\\"\\n\\n\\\"My fathers fought with the naked redmen!\\\" returned\ + \ the Indian sternly,\\nin the same language. \\\"Is there no difference, Hawkeye,\ + \ between the\\nstone-headed arrow of the warrior, and the leaden bullet with\ + \ which you\\nkill?\\\"\\n\\n\\\"There is reason in an Indian, though nature has\ + \ made him with a red\\nskin!\\\" said the white man, shaking his head like one\ + \ on whom such an\\nappeal to his justice was not thrown away. For a moment he\ + \ appeared to\\nbe conscious of having the worst of the argument, then, rallying\ + \ again,\\nhe answered the objection of his antagonist in the best manner his\\\ + nlimited information would allow: \\\"I am no scholar, and I care not who\\nknows\ + \ it; but judging from what I have seen, at deer chases and squirrel\\nhunts,\ + \ of the sparks below, I should think a rifle in the hands of their\\ngrandfathers\ + \ was not so dangerous as a hickory bow and a good flint-head\\nmight be, if drawn\ + \ with Indian judgment, and sent by an Indian eye.\\\"\\n\\n\\\"You have the story\ + \ told by your fathers,\\\" returned the other, coldly\\nwaving his hand. \\\"\ + What say your old men? do they tell the young\\nwarriors, that the pale-faces\ + \ met the redmen, painted for war and armed\\nwith the stone hatchet and wooden\ + \ gun?\\\"\\n\\n\\\"I am not a prejudiced man, nor one who vaunts himself on his\ + \ natural\\nprivileges, though the worst enemy I have on earth, and he is an\\\ + nIroquois, daren't deny that I am genuine white,\\\" the scout replied,\\nsurveying,\ + \ with secret satisfaction, the faded color of his bony and\\nsinewy hand; \\\"\ + and I am willing to own that my people have many ways, of\\nwhich, as an honest\ + \ man, I can't approve. It is one of their customs to\\nwrite in books what they\ + \ have done and seen, instead of telling them in\\ntheir villages, where the lie\ + \ can be given to the face of a cowardly\\nboaster, and the brave soldier can\ + \ call on his comrades to witness for\\nthe truth of his words. In consequence\ + \ of this bad fashion, a man who is\\ntoo conscientious to misspend his days among\ + \ the women, in learning the\\nnames of black marks, may never hear of the deeds\ + \ of his fathers, nor\\nfeel a pride in striving to outdo them. For myself, I\ + \ conclude the\\nBumppos could shoot, for I have a natural turn with a rifle,\ + \ which must\\nhave been handed down from generation to generation, as, our holy\\\ + ncommandments tell us, all good and evil gifts are bestowed; though I\\nshould\ + \ be loth to answer for other people in such a matter. But every\\nstory has its\ + \ two sides; so I ask you, Chingachgook, what passed,\\naccording to the traditions\ + \ of the redmen, when our fathers first met?\\\"\\n\\nA silence of a minute succeeded,\ + \ during which the Indian sat mute; then,\\nfull of the dignity of his office,\ + \ he commenced his brief tale, with a\\nsolemnity that served to heighten its\ + \ appearance of truth.\\n\\n\\\"Listen, Hawkeye, and your ear shall drink no lie.\ + \ 'Tis what my fathers\\nhave said, and what the Mohicans have done.\\\" He hesitated\ + \ a single\\ninstant, and bending a cautious glance toward his companion, he\\\ + ncontinued, in a manner that was divided between interrogation and\\nassertion,\ + \ \\\"Does not this stream at our feet run towards the summer,\\nuntil its waters\ + \ grow salt, and the current flows upward?\\\"\\n\\n\\\"It can't be denied that\ + \ your traditions tell you true in both these\\nmatters,\\\" said the white man;\ + \ \\\"for I have been there, and have seen\\nthem; though, why water, which is\ + \ so sweet in the shade, should become\\nbitter in the sun, is an alteration for\ + \ which I have never been able to\\naccount.\\\"\\n\\n\\\"And the current!\\\"\ + \ demanded the Indian, who expected his reply with that\\nsort of interest that\ + \ a man feels in the confirmation of testimony, at\\nwhich he marvels even while\ + \ he respects it; \\\"the fathers of Chingachgook\\nhave not lied!\\\"\\n\\n\\\ + \"The Holy Bible is not more true, and that is the truest thing in\\nnature. They\ + \ call this up-stream current the tide, which is a thing soon\\nexplained, and\ + \ clear enough. Six hours the waters run in, and six hours\\nthey run out, and\ + \ the reason is this: when there is higher water in the\\nsea than in the river,\ + \ they run in, until the river gets to be highest,\\nand then it runs out again.\\\ + \"\\n\\n\\\"The waters in the woods, and on the great lakes, run downward until\\\ + nthey lie like my hand,\\\" said the Indian, stretching the limb\\nhorizontally\ + \ before him, \\\"and then they run no more.\\\"\\n\\n\\\"No honest man will deny\ + \ it,\\\" said the scout, a little nettled at the\\nimplied distrust of his explanation\ + \ of the mystery of the tides; \\\"and I\\ngrant that it is true on the small\ + \ scale, and where the land is level.\\nBut everything depends on what scale you\ + \ look at things. Now, on the\\nsmall scale, the 'arth is level; but on the large\ + \ scale it is round. In\\nthis manner, pools and ponds, and even the great fresh-water\ + \ lake, may\\nbe stagnant, as you and I both know they are, having seen them;\ + \ but when\\nyou come to spread water over a great tract, like the sea, where\ + \ the\\nearth is round, how in reason can the water be quiet? You might as well\\\ + nexpect the river to lie still on the brink of those black rocks a mile\\nabove\ + \ us, though your own ears tell you that it is tumbling over them at\\nthis very\ + \ moment!\\\"\\n\\nIf unsatisfied by the philosophy of his companion, the Indian\ + \ was far\\ntoo dignified to betray his unbelief. He listened like one who was\\\ + nconvinced, and resumed his narrative in his former solemn manner.\\n\\n\\\"We\ + \ came from the place where the sun is hid at night, over great plains\\nwhere\ + \ the buffaloes live, until we reached the big river. There we\\nfought the Alligewi,\ + \ till the ground was red with their blood. From the\\nbanks of the big river\ + \ to the shores of the salt lake, there was none to\\nmeet us. The Maquas followed\ + \ at a distance. We said the country should\\nbe ours from the place where the\ + \ water runs up no longer on this stream,\\nto a river twenty suns' journey toward\ + \ the summer. The land we had taken\\nlike warriors, we kept like men. We drove\ + \ the Maquas into the woods with\\nthe bears. They only tasted salt at the licks;\ + \ they drew no fish from\\nthe great lake; we threw them the bones.\\\"\\n\\n\\\ + \"All this I have heard and believe,\\\" said the white man, observing that\\\ + nthe Indian paused: \\\"but it was long before the English came into the\\ncountry.\\\ + \"\\n\\n\\\"A pine grew then where this chestnut now stands. The first pale-faces\\\ + nwho came among us spoke no English. They came in a large canoe, when my\\nfathers\ + \ had buried the tomahawk with the redmen around them. Then,\\nHawkeye,\\\" he\ + \ continued, betraying his deep emotion only by permitting\\nhis voice to fall\ + \ to those low, guttural tones, which rendered his\\nlanguage, as spoken at times,\ + \ so very musical; \\\"then, Hawkeye, we were\\none people, and we were happy.\ + \ The salt lake gave us its fish, the wood\\nits deer, and the air its birds.\ + \ We took wives who bore us children; we\\nworshipped the Great Spirit; and we\ + \ kept the Maquas beyond the sound of\\nour songs of triumph!\\\"\\n\\n\\\"Know\ + \ you anything of your own family at that time?\\\" demanded the white.\\n\\\"\ + But you are a just man, for an Indian! and, as I suppose you hold their\\ngifts,\ + \ your fathers must have been brave warriors, and wise men at the\\ncouncil fire.\\\ + \"\\n\\n\\\"My tribe is the grandfather of nations, but I am an unmixed man. The\\\ + nblood of chiefs is in my veins, where it must stay forever. The Dutch\\nlanded,\ + \ and gave my people the fire-water; they drank until the heavens\\nand the earth\ + \ seemed to meet, and they foolishly thought they had found\\nthe Great Spirit.\ + \ Then they parted with their land. Foot by foot, they\\nwere driven back from\ + \ the shores, until I, that am a chief and a\\nsagamore, have never seen the sun\ + \ shine but through the trees, and have\\nnever visited the graves of, my fathers!\\\ + \"\\n\\n\\\"Graves bring solemn feelings over the mind,\\\" returned the scout,\ + \ a good\\ndeal touched at the calm suffering of his companion; \\\"and they often\ + \ aid\\na man in his good intentions; though, for myself, I expect to leave my\\\ + nown bones unburied, to bleach in the woods, or to be torn asunder by the\\nwolves.\ + \ But where are to be found those of your race who came to their\\nkin in the\ + \ Delaware country, so many summers since?\\\"\\n\\n\\\"Where are the blossoms\ + \ of those summers!--fallen, one by one: so all of\\nmy family departed, each\ + \ in his turn, to the land of spirits. I am on\\nthe hill-top, and must go down\ + \ into the valley; and when Uncas follows\\nin my footsteps, there will no longer\ + \ be any of the blood of the\\nsagamores, for my boy is the last of the Mohicans.\\\ + \"\\n\\n\\\"Uncas is here!\\\" said another voice, in the same soft, guttural\ + \ tones,\\nnear his elbow; \\\"who speaks to Uncas?\\\"\\n\\nThe white man loosened\ + \ his knife in his leathern sheath, and made an\\ninvoluntary movement of the\ + \ hand towards his rifle, at this sudden\\ninterruption; but the Indian sat composed,\ + \ and without turning his head\\nat the unexpected sounds.\\n\\nAt the next instant,\ + \ a youthful warrior passed between them, with a\\nnoiseless step, and seated\ + \ himself on the bank of the rapid stream. No\\nexclamation of surprise escaped\ + \ the father, nor was any question asked,\\nor reply given, for several minutes;\ + \ each appearing to await the moment\\nwhen he might speak, without betraying\ + \ womanish curiosity or childish\\nimpatience. The white man seemed to take counsel\ + \ from their customs,\\nand, relinquishing his grasp of the rifle, he also remained\ + \ silent and\\nreserved. At length Chingachgook turned his eyes slowly towards\ + \ his son,\\nand demanded,--\\n\\n\\\"Do the Maquas dare to leave the print of\ + \ their moccasins in these\\nwoods?\\\"\\n\\n\\\"I have been on their trail,\\\ + \" replied the young Indian, \\\"and know that\\nthey number as many as the fingers\ + \ of my two hands; but they lie hid,\\nlike cowards.\\\"\\n\\n\\\"The thieves\ + \ are outlying for scalps and plunder!\\\" said the white man,\\nwhom we shall\ + \ call Hawkeye, after the manner of his companions. \\\"That\\nbushy Frenchman,\ + \ Montcalm, will send his spies into our very camp, but\\nhe will know what road\ + \ we travel!\\\"\\n\\n\\\"Tis enough!\\\" returned the father, glancing his eye\ + \ towards the setting\\nsun; \\\"they shall be driven like deer from their bushes.\ + \ Hawkeye, let us\\neat to-night, and show the Maquas that we are men to-morrow.\\\ + \"\\n\\n\\\"I am as ready to do the one as the other; but to fight the Iroquois\\\ + n'tis necessary to find the skulkers; and to eat, 'tis necessary to get\\nthe\ + \ game--talk of the devil and he will come; there is a pair of the\\nbiggest antlers\ + \ I have seen this season, moving the bushes below the\\nhill! Now, Uncas,\\\"\ + \ he continued in a half whisper, and laughing with a\\nkind of inward sound,\ + \ like one who had learnt to be watchful, \\\"I will\\nbet my charger three times\ + \ full of powder, against a foot of wampum,\\nthat I take him atwixt the eyes,\ + \ and nearer to the right than to the\\nleft.\\\"\\n\\n\\\"It cannot be!\\\" said\ + \ the young Indian, springing to his feet with\\nyouthful eagerness; \\\"all but\ + \ the tips of his horns are hid!\\\"\\n\\n\\\"He's a boy!\\\" said the white man,\ + \ shaking his head while he spoke, and\\naddressing the father. \\\"Does he think\ + \ when a hunter sees a part of the\\ncreatur', he can't tell where the rest of\ + \ him should be!\\\"\\n\\n[Illustration: _Copyright by Charles Scribner's Sons_\\\ + n\\nUNCAS SLAYS A DEER\\n\\n_Avoiding the horns of the infuriated animal, Uncas\ + \ darted to his side,\\nand passed his knife across the throat_]\\n\\nAdjusting\ + \ his rifle, he was about to make an exhibition of that skill,\\non which he so\ + \ much valued himself, when the warrior struck up the piece\\nwith his hand, saying--\\\ + n\\n\\\"Hawkeye! will you fight the Maquas?\\\"\\n\\n\\\"These Indians know the\ + \ nature of the woods, as it might be by\\ninstinct!\\\" returned the scout, dropping\ + \ his rifle, and turning away like\\na man who was convinced of his error. \\\"\ + I must leave the buck to your\\narrow, Uncas, or we may kill a deer for them thieves,\ + \ the Iroquois, to\\neat.\\\"\\n\\nThe instant the father seconded this intimation\ + \ by an expressive gesture\\nof the hand, Uncas threw himself on the ground, and\ + \ approached the\\nanimal with wary movements. When within a few yards of the\ + \ cover, he\\nfitted an arrow to his bow with the utmost care, while the antlers\\\ + nmoved, as if their owner snuffed an enemy in the tainted air. In another\\nmoment\ + \ the twang of the cord was heard, a white streak was seen glancing\\ninto the\ + \ bushes, and the wounded buck plunged from the cover, to the\\nvery feet of his\ + \ hidden enemy. Avoiding the horns of the infuriated\\nanimal, Uncas darted to\ + \ his side, and passed his knife across the\\nthroat, when bounding to the edge\ + \ of the river it fell, dyeing the\\nwaters with its blood.\\n\\n\\\"'Twas done\ + \ with Indian skill,\\\" said the scout, laughing inwardly, but\\nwith vast satisfaction;\ + \ \\\"and 'twas a pretty sight to behold! Though an\\narrow is a near shot, and\ + \ needs a knife to finish the work.\\\"\\n\\n\\\"Hugh!\\\" ejaculated his companion,\ + \ turning quickly, like a hound who\\nscented game.\\n\\n\\\"By the Lord, there\ + \ is a drove of them!\\\" exclaimed the scout, whose eyes\\nbegan to glisten with\ + \ the ardor of his usual occupation; \\\"if they come\\nwithin range of a bullet\ + \ I will drop one, though the whole Six Nations\\nshould be lurking within sound!\ + \ What do you hear, Chingachgook? for to\\nmy ears the woods are dumb.\\\"\\n\\\ + n\\\"There is but one deer, and he is dead,\\\" said the Indian, bending his\\\ + nbody till his ear nearly touched the earth. \\\"I hear the sounds of feet!\\\"\ + \\n\\n\\\"Perhaps the wolves have driven the buck to shelter, and are following\\\ + non his trail.\\\"\\n\\n\\\"No. The horses of white men are coming!\\\" returned\ + \ the other, raising\\nhimself with dignity, and resuming his seat on the log\ + \ with his former\\ncomposure. \\\"Hawkeye, they are your brothers; speak to them.\\\ + \"\\n\\n\\\"That will I, and in English that the king needn't be ashamed to\\\ + nanswer,\\\" returned the hunter, speaking in the language of which he\\nboasted;\ + \ \\\"but I see nothing, nor do I hear the sounds of man or beast;\\n'tis strange\ + \ that an Indian should understand white sounds better than a\\nman who, his very\ + \ enemies will own, has no cross in his blood, although\\nhe may have lived with\ + \ the redskins long enough to be suspected! Ha!\\nthere goes something like the\ + \ cracking of a dry stick, too--now I hear\\nthe bushes move--yes, yes, there\ + \ is a trampling that I mistook for the\\nfalls--and--but here they come themselves;\ + \ God keep them from the\\nIroquois!\\\"\\n\\n\\n\\n\",\n \"output\": \"\ + In another part of the forest by the river a few miles to the west, Hawkeye and\ + \ Chingachgook appear to be waiting for someone as they talk with low voices.\ + \ It is now afternoon. The Indian and the scout are attired according to their\ + \ forest habits: Chingachgook with his semi-nude, war-painted body and scalping\ + \ tuft of hair, his tomahawk, scalping knife, and short rifle; Hawkeye with his\ + \ hunting shirt, skin cap, buckskin leggings, knife, pouch and horn, and long\ + \ rifle. They discuss their respective forefathers, and Chingachgook relates the\ + \ slow demise of his tribe of Mohicans so that only he and his son Uncas now remain.\ + \ At the mention of his name, Uncas, a youthful warrior dressed much like Hawkeye,\ + \ appears and says that he has been on the trail of the Maquas, another name for\ + \ the Mengwe or Iroquois, their natural enemies. The antlers of a deer are seen\ + \ in the distance, and Hawkeye is about to shoot the animal for food when the\ + \ warrior warns him that a shot will warn the enemy. Just as Uncas kills it with\ + \ an arrow, they hear the sounds of feet which Chingachgook recognizes as the\ + \ horses of white men.\"\n },\n \"truncated_cells\": []\n }\n]" + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"tner/conll2003\"\nFEATURES: {'tokens': {'feature':\ + \ {'dtype': 'string', '_type': 'Value'}, '_type': 'Sequence'}, 'tags': {'feature':\ + \ {'dtype': 'int32', '_type': 'Value'}, '_type': 'Sequence'}}\nDATA SAMPLE:\n\ + [\n {\n \"row_idx\": 0,\n \"row\": {\n \"tokens\": [\n \"EU\"\ + ,\n \"rejects\",\n \"German\",\n \"call\",\n \"to\"\ + ,\n \"boycott\",\n \"British\",\n \"lamb\",\n \".\"\ + \n ],\n \"tags\": [\n 1,\n 0,\n 2,\n 0,\n\ + \ 0,\n 0,\n 2,\n 0,\n 0\n ]\n },\n\ + \ \"truncated_cells\": []\n },\n {\n \"row_idx\": 1,\n \"row\": {\n\ + \ \"tokens\": [\n \"Peter\",\n \"Blackburn\"\n ],\n \ + \ \"tags\": [\n 3,\n 4\n ]\n },\n \"truncated_cells\"\ + : []\n }\n]" +- source_sentence: 'USER_QUERY: text to sql dataset' + sentences: + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"InfiniFlow/text2sql\"\nFEATURES: {'text':\ + \ {'dtype': 'string', '_type': 'Value'}}\nDATA SAMPLE:\n[\n {\n \"row_idx\"\ + : 0,\n \"row\": {\n \"text\": \"\"\n },\n \"truncated_cells\": []\n\ + \ },\n {\n \"row_idx\": 1,\n \"row\": {\n \"text\": \"### \\u7528\\\ + u6237\\u8868\\uff08users\\uff09\"\n },\n \"truncated_cells\": []\n }\n]" + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"andresgtn/celeb-identities\"\nFEATURES:\ + \ {'image': {'_type': 'Image'}, 'label': {'names': ['Brad_Pitt', 'Donald_Trump',\ + \ 'Emma_Stone', 'Jessica_Alba', 'Johnny_Depp', 'Julia_Roberts'], '_type': 'ClassLabel'}}\n\ + DATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\": {\n \"image\": {\n\ + \ \"src\": \"https://datasets-server.huggingface.co/assets/andresgtn/celeb-identities/--/default/train/0/image/image.jpg?Expires=1726591617&Signature=q--3qiojJDrvuSWWGLgJWrS4c6npTB56vauiMf7TqH8cYxEoUIBTGWHpn2d38sz9duxhXFgmmlkGIk042lszNSMnkVK9Y3vQJI9-FhXjRpZzWlS-PY-e-7ly7fssmssEy0NSinNQ-z8hg2fhs1T1N1iHH9-vyr1B1QWqJYVcBw7ccoDAEr6nlTzQeHKqyEoXEachNGgABSHqIErpBz4aaCP-af~jUAiojqDluy55H4d8mZ6xfs9dgt5WCLTJ0mDkSRiHdDKQ4RA12R6JAk0zOj19Ldhp6wXXrf9jFuSvzKEU7ElwE8qN~MSak9sTM81ngizmx42Y~Fgx270MlRQPLQ__&Key-Pair-Id=K3EI6M078Z3AC3\"\ + ,\n \"height\": 240,\n \"width\": 165\n },\n \"label\"\ + : 0\n },\n \"truncated_cells\": []\n },\n {\n \"row_idx\": 1,\n \ + \ \"row\": {\n \"image\": {\n \"src\": \"https://datasets-server.huggingface.co/assets/andresgtn/celeb-identities/--/default/train/1/image/image.jpg?Expires=1726591617&Signature=sJR70D9XoXJtYCKDloI2SvXrMHeapO5og240B4WuNMO8Mr-q3-9ZunPQX22-fa0QkVRdy9R4NQoAto34KGwJGfn3sDZL-YBQboROs1OMwuYBhtNh1~1SgBKKhuhww-QQce9Z7DD4MwGy8j1HCdLOJmkvFiBbd-B~w6kdTOBbekCJPJmrr1zGz~cXkg7zzpnKpBcScK8XA0Y9ESNkKVl~4Q~RTl839vo93NqlKoWW2gmVCM0d5BFn3~mZm9HHWj1bOPerssRcYLSwwC1iOB5fmK-Y6e~fRWMnrnq94N3O20S-uYher6Q7wtssANteZGCKIJVBULAb3oRU0o~NN1UhsQ__&Key-Pair-Id=K3EI6M078Z3AC3\"\ + ,\n \"height\": 240,\n \"width\": 165\n },\n \"label\"\ + : 0\n },\n \"truncated_cells\": []\n }\n]" + - "NEGATIVE: DATASET_NAME: \"lamini/spider_text_to_sql\"\nFEATURES: {'input': {'dtype':\ + \ 'string', '_type': 'Value'}, 'output': {'dtype': 'string', '_type': 'Value'}}\n\ + DATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\": {\n \"input\": \"\ + [INST] Here is a database schema:\\ndepartment :\\nDepartment_ID [ INT ] primary_key\\\ + nName [ TEXT ]\\nCreation [ TEXT ]\\nRanking [ INT ]\\nBudget_in_Billions [ INT\ + \ ]\\nNum_Employees [ INT ]\\n\\nhead :\\nhead_ID [ INT ] primary_key\\nname [\ + \ TEXT ]\\nborn_state [ TEXT ]\\nage [ INT ]\\n\\nmanagement :\\ndepartment_ID\ + \ [ INT ] primary_key management.department_ID = department.Department_ID\\nhead_ID\ + \ [ INT ] management.head_ID = head.head_ID\\ntemporary_acting [ TEXT ]\\n\\nPlease\ + \ write me a SQL statement that answers the following question: How many heads\ + \ of the departments are older than 56 ? [/INST]\",\n \"output\": \"SELECT\ + \ count(*) FROM head WHERE age > 56;\"\n },\n \"truncated_cells\": []\n\ + \ },\n {\n \"row_idx\": 1,\n \"row\": {\n \"input\": \"[INST] Here\ + \ is a database schema:\\ndepartment :\\nDepartment_ID [ INT ] primary_key\\nName\ + \ [ TEXT ]\\nCreation [ TEXT ]\\nRanking [ INT ]\\nBudget_in_Billions [ INT ]\\\ + nNum_Employees [ INT ]\\n\\nhead :\\nhead_ID [ INT ] primary_key\\nname [ TEXT\ + \ ]\\nborn_state [ TEXT ]\\nage [ INT ]\\n\\nmanagement :\\ndepartment_ID [ INT\ + \ ] primary_key management.department_ID = department.Department_ID\\nhead_ID\ + \ [ INT ] management.head_ID = head.head_ID\\ntemporary_acting [ TEXT ]\\n\\nPlease\ + \ write me a SQL statement that answers the following question: List the name,\ + \ born state and age of the heads of departments ordered by age. [/INST]\",\n\ + \ \"output\": \"SELECT name , born_state , age FROM head ORDER BY age;\"\ + \n },\n \"truncated_cells\": []\n }\n]" +- source_sentence: 'USER_QUERY: multimodal conversation dataset' + sentences: + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"BUAADreamer/llava-en-zh-2k\"\nFEATURES:\ + \ {'messages': [{'role': {'dtype': 'string', '_type': 'Value'}, 'content': {'dtype':\ + \ 'string', '_type': 'Value'}}], 'images': {'feature': {'_type': 'Image'}, '_type':\ + \ 'Sequence'}}\nDATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\": {\n \ + \ \"messages\": [\n {\n \"role\": \"user\",\n \"content\"\ + : \"How many baseball players are visible in the image?\"\n },\n\ + \ {\n \"role\": \"assistant\",\n \"content\": \"There\ + \ are three baseball players visible in the image.\"\n },\n {\n\ + \ \"role\": \"user\",\n \"content\": \"What are the players\ + \ holding in their hands?\"\n },\n {\n \"role\": \"assistant\"\ + ,\n \"content\": \"The players are holding baseball bats in their hands.\"\ + \n },\n {\n \"role\": \"user\",\n \"content\"\ + : \"Are the players in a dugout?\"\n },\n {\n \"role\"\ + : \"assistant\",\n \"content\": \"Yes, the three baseball players are\ + \ standing in the dugout.\"\n },\n {\n \"role\": \"user\"\ + ,\n \"content\": \"Is the image in color or black and white?\"\n \ + \ },\n {\n \"role\": \"assistant\",\n \"content\"\ + : \"The image is an old black and white photo of the three baseball players.\"\ + \n },\n {\n \"role\": \"user\",\n \"content\"\ + : \"Do the players belong to a specific baseball team?\"\n },\n \ + \ {\n \"role\": \"assistant\",\n \"content\": \"Yes, the players\ + \ belong to the Boston Red Sox baseball team.\"\n }\n ],\n \"\ + images\": [\n {\n \"src\": \"https://datasets-server.huggingface.co/assets/BUAADreamer/llava-en-zh-2k/--/fba994c834822bddd3cd79e929c33135f4289d2b/--/en/train/0/images/image-1d100e9.jpg?Expires=1726591851&Signature=QHGD147HyWamORfSjz0QoG51Ru86g3STPBNDAEOLK7NTq8Y~b4vVt3u~XF9njlRWwNnVF7AQ8-l9f2pCWxggPnZw1wZEfBAC5Q1oOW2CwT-gMYME~I-9qeJrbQtkszer9U0-H5rkECK0DVgWKKIagyjjJBmSSLz2QAb3BlV5aEMGEbJj-4G1X2xzoVaMIfg~ogGSb6~2PAKoLoKMPUZIxrm~2LoPEGVJeXr7JIa8-ISVukshW8ZEDQII~eXfOBeEsTd1Mk5BpJz26dH3W7X2rdhq54KeQ8H9OBt~HlU0XbXigFABeMymb-GBH-fDNHUDmb9Zt3io5hBMOd7I64DLyQ__&Key-Pair-Id=K3EI6M078Z3AC3\"\ + ,\n \"height\": 308,\n \"width\": 384\n }\n ]\n\ + \ },\n \"truncated_cells\": []\n },\n {\n \"row_idx\": 1,\n \"row\"\ + : {\n \"messages\": [\n {\n \"role\": \"user\",\n \ + \ \"content\": \"What is the man wearing in the image?\"\n },\n\ + \ {\n \"role\": \"assistant\",\n \"content\": \"The man\ + \ in the image is wearing chaps, a cowboy hat, and is dressed like a cowboy.\"\ + \n },\n {\n \"role\": \"user\",\n \"content\"\ + : \"What is the cowboy doing in the image?\"\n },\n {\n \ + \ \"role\": \"assistant\",\n \"content\": \"The cowboy is riding on\ + \ a horse in a grassy field and pointing into the distance, likely directing attention\ + \ towards a specific location or object.\"\n },\n {\n \"\ + role\": \"user\",\n \"content\": \"What kind of landscape surrounds the\ + \ cowboy and his horse?\"\n },\n {\n \"role\": \"assistant\"\ + ,\n \"content\": \"The cowboy and his horse are surrounded by a grassy\ + \ field, which creates a natural and open setting for the scene.\"\n },\n\ + \ {\n \"role\": \"user\",\n \"content\": \"What is the\ + \ significance of the cowboy attire and its relation to horse riding?\"\n \ + \ },\n {\n \"role\": \"assistant\",\n \"content\"\ + : \"Cowboy attire is deeply rooted in the history, culture, and traditions of\ + \ the American West. It is both functional and symbolic, representing the hardworking\ + \ and rugged lifestyle of cowboys who spent much of their time working with livestock\ + \ and riding horses.\\n\\nThe key elements of cowboy attire, like the cowboy hat\ + \ and chaps, serve specific purposes:\\n1. Cowboy Hat: A wide-brimmed hat made\ + \ of stiffened felt or straw is designed to protect the cowboy from the sun, wind,\ + \ and rain. The hat's distinctive high crown allows for air circulation and provides\ + \ a touch of style.\\n2. Chaps: These are sturdy leather leg coverings worn over\ + \ pants to protect the cowboy's legs from rough terrain, thorny bushes, and other\ + \ elements while riding or working with livestock. Chaps provide an additional\ + \ layer of protection and durability, particularly while riding through rugged\ + \ landscapes and herding cattle.\\n\\nIn summary, the cowboy attire seen in the\ + \ image is not just a fashion statement but is deeply connected to the history\ + \ and traditions of the American West. The functional pieces of clothing, like\ + \ the cowboy hat and chaps, are designed to protect and support the cowboy during\ + \ horse riding and working with livestock.\"\n }\n ],\n \"images\"\ + : [\n {\n \"src\": \"https://datasets-server.huggingface.co/assets/BUAADreamer/llava-en-zh-2k/--/fba994c834822bddd3cd79e929c33135f4289d2b/--/en/train/1/images/image-1d100e9.jpg?Expires=1726591851&Signature=WyNDGZXVbzPOU9iOQSDPFt1MizgmdT-KqdVAG8nIVSK0Gg8OO-qmhKxgIVjyWMHnWyNbW5svuMoukPMyv9hiHMsNh0YmzdjMR9Gwb6mRvsisEAdaLl71Q053MYxEqkZWCB6PbXG5yEazHL4RHvDphsUEhZS-0Yk8Kzx0HHc12HNaJfiO4fO4IPkY3eLw5xLgNoKIcvvO9TDo0JEbc1ej6YkxGUdqXyVrG2Y4zYnhrCM0drgKVzq24cQ9YZ78HW5f-EsXsftbj0ZzEg4SKcuVgrqaKG8SJ~i0aV-OtkXiTCWxW16D4hfsmpXZShZAHesa1EOGprkYdtQG4Kfte12maQ__&Key-Pair-Id=K3EI6M078Z3AC3\"\ + ,\n \"height\": 288,\n \"width\": 384\n }\n ]\n\ + \ },\n \"truncated_cells\": []\n }\n]" + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"suolyer/eprstmt\"\nFEATURES: {'input': {'dtype':\ + \ 'string', '_type': 'Value'}, 'output': {'dtype': 'string', '_type': 'Value'},\ + \ 'choice': {'feature': {'dtype': 'string', '_type': 'Value'}, '_type': 'Sequence'},\ + \ 'label': {'dtype': 'int64', '_type': 'Value'}, 'id': {'dtype': 'int64', '_type':\ + \ 'Value'}}\nDATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\": {\n \ + \ \"input\": \"\\u7ed9\\u51fa\\u5546\\u54c1\\u7684\\u8bc4\\u8bba\\u6587\\u672c\\\ + u53ca\\u5176\\u6781\\u6027\\uff08\\u6b63\\u9762\\u6216\\u8d1f\\u9762\\uff09\\\ + u3002\\u5982\\u679c\\u7ed9\\u5b9a\\u7684\\u53e5\\u5b50\\u53ca\\u5176\\u6781\\\ + u6027\\u5339\\u914d\\uff0c\\u5219\\u751f\\u6210\\u7b54\\u6848\\u201c\\u6b63\\\ + u9762\\u201d\\uff0c\\u5426\\u5219\\u751f\\u6210\\u7b54\\u6848\\u201c\\u8d1f\\\ + u9762\\u201d\\u3002\\u5475\\u5475\\u4e86 \\u8fd9\\u7269\\u6d41\\u901f\\u5ea6\\\ + u4e5f\\u662f\\u6ca1\\u8c01\\u4e86 \\u540c\\u57ce\\u7f51\\u8d2d\\u7adf\\u7136\\\ + u4e09\\u5929\\u4e86\\u8fd8\\u4e0d\\u5230\",\n \"output\": \"\\u8d1f\\u9762\"\ + ,\n \"choice\": [\n \"\\u8d1f\\u9762\",\n \"\\u6b63\\u9762\"\ + \n ],\n \"label\": 0,\n \"id\": 0\n },\n \"truncated_cells\"\ + : []\n },\n {\n \"row_idx\": 1,\n \"row\": {\n \"input\": \"\\u7ed9\\\ + u51fa\\u5546\\u54c1\\u7684\\u8bc4\\u8bba\\u6587\\u672c\\u53ca\\u5176\\u6781\\\ + u6027\\uff08\\u6b63\\u9762\\u6216\\u8d1f\\u9762\\uff09\\u3002\\u5982\\u679c\\\ + u7ed9\\u5b9a\\u7684\\u53e5\\u5b50\\u53ca\\u5176\\u6781\\u6027\\u5339\\u914d\\\ + uff0c\\u5219\\u751f\\u6210\\u7b54\\u6848\\u201c\\u6b63\\u9762\\u201d\\uff0c\\\ + u5426\\u5219\\u751f\\u6210\\u7b54\\u6848\\u201c\\u8d1f\\u9762\\u201d\\u3002\\\ + u8fd8\\u4e0d\\u9519\\uff0c\\u7b49\\u8bd5\\u7528\\u4e00\\u6bb5\\u65f6\\u95f4\\\ + u518d\\u8bf4\",\n \"output\": \"\\u6b63\\u9762\",\n \"choice\": [\n\ + \ \"\\u8d1f\\u9762\",\n \"\\u6b63\\u9762\"\n ],\n \"label\"\ + : 1,\n \"id\": 0\n },\n \"truncated_cells\": []\n }\n]" + - "NEGATIVE: DATASET_NAME: \"passing2961/photochat_plus\"\nFEATURES: {'photo_description':\ + \ {'dtype': 'string', '_type': 'Value'}, 'trigger_sentences': {'feature': {'dtype':\ + \ 'string', '_type': 'Value'}, '_type': 'Sequence'}, 'dialogue_id': {'dtype':\ + \ 'int64', '_type': 'Value'}, 'photo_url': {'dtype': 'string', '_type': 'Value'},\ + \ 'dialogue': [{'message': {'dtype': 'string', '_type': 'Value'}, 'share_photo':\ + \ {'dtype': 'bool', '_type': 'Value'}, 'user_id': {'dtype': 'int64', '_type':\ + \ 'Value'}}], 'image_descriptions': {'feature': {'dtype': 'string', '_type': 'Value'},\ + \ '_type': 'Sequence'}, 'intents': {'feature': {'dtype': 'string', '_type': 'Value'},\ + \ '_type': 'Sequence'}, 'salient_information': {'feature': {'dtype': 'string',\ + \ '_type': 'Value'}, '_type': 'Sequence'}, 'photo_id': {'dtype': 'string', '_type':\ + \ 'Value'}}\nDATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\": {\n \ + \ \"photo_description\": \"The photo has your brother Kannon. Objects in the photo:\ + \ Man\",\n \"trigger_sentences\": [\n \"How is Kannon doing?\"\n \ + \ ],\n \"dialogue_id\": 500,\n \"photo_url\": \"https://farm6.staticflickr.com/151/369716968_bde7e83418_o.jpg\"\ + ,\n \"dialogue\": [\n {\n \"message\": \"Hello, how have\ + \ you been, dear friend?\",\n \"share_photo\": false,\n \"user_id\"\ + : 1\n },\n {\n \"message\": \"Great!\",\n \"share_photo\"\ + : false,\n \"user_id\": 0\n },\n {\n \"message\"\ + : \"Thanks for asking\",\n \"share_photo\": false,\n \"user_id\"\ + : 0\n },\n {\n \"message\": \"And how have you been?\"\ + ,\n \"share_photo\": false,\n \"user_id\": 0\n },\n \ + \ {\n \"message\": \"It seems like we haven't talked in forever\"\ + ,\n \"share_photo\": false,\n \"user_id\": 0\n },\n \ + \ {\n \"message\": \"I have been doing well, keeping busy, spent\ + \ a lot of time outdoors. What have you been up to?\",\n \"share_photo\"\ + : false,\n \"user_id\": 1\n },\n {\n \"message\"\ + : \"Last night my brother Kannon did a poetry reading\",\n \"share_photo\"\ + : false,\n \"user_id\": 0\n },\n {\n \"message\"\ + : \"Really? How did it go? You know how much I love poetry.\",\n \"share_photo\"\ + : false,\n \"user_id\": 1\n },\n {\n \"message\"\ + : \"It went really well\",\n \"share_photo\": false,\n \"user_id\"\ + : 0\n },\n {\n \"message\": \"Do you remember my brother\ + \ Kannon?\",\n \"share_photo\": false,\n \"user_id\": 0\n \ + \ },\n {\n \"message\": \"Absolutely! How could I forget,\ + \ he left quite an impression\",\n \"share_photo\": false,\n \ + \ \"user_id\": 1\n },\n {\n \"message\": \"How is Kannon\ + \ doing?\",\n \"share_photo\": false,\n \"user_id\": 1\n \ + \ },\n {\n \"message\": \"\",\n \"share_photo\":\ + \ true,\n \"user_id\": 0\n },\n {\n \"message\"\ + : \"Great\",\n \"share_photo\": false,\n \"user_id\": 0\n \ + \ },\n {\n \"message\": \"Here is a photo from last night\"\ + ,\n \"share_photo\": false,\n \"user_id\": 0\n },\n \ + \ {\n \"message\": \"Wow, he seems so confident in that pic! Wish\ + \ that I could have been there.\",\n \"share_photo\": false,\n \ + \ \"user_id\": 1\n }\n ],\n \"image_descriptions\": [\n \ + \ \"A photo of Kannon\",\n \"A picture of Kannon.\",\n \"a\ + \ photo of recent situation\"\n ],\n \"intents\": [\n \"Information\ + \ Dissemination\",\n \"Social Bonding\"\n ],\n \"salient_information\"\ + : [\n \"poetry\",\n \"How is Kannon doing?\",\n \"Kannon\ + \ doing\"\n ],\n \"photo_id\": \"train/19e8f436d4b2fc25\"\n },\n\ + \ \"truncated_cells\": []\n },\n {\n \"row_idx\": 1,\n \"row\": {\n\ + \ \"photo_description\": \"The photo has your uncle Kieran. Objects in the\ + \ photo: Clothing, Man\",\n \"trigger_sentences\": [\n \"guess what\ + \ new animal he got?\",\n \"He's always had goats and chickens, but guess\ + \ what new animal he got?\"\n ],\n \"dialogue_id\": 501,\n \"photo_url\"\ + : \"https://farm8.staticflickr.com/53/189664134_f70fc8947a_o.jpg\",\n \"\ + dialogue\": [\n {\n \"message\": \"Hey! You remember my uncle\ + \ who owns the hobby farm, right?\",\n \"share_photo\": false,\n \ + \ \"user_id\": 0\n },\n {\n \"message\": \"Yeah i\ + \ do\",\n \"share_photo\": false,\n \"user_id\": 1\n \ + \ },\n {\n \"message\": \"Uncle Keiran?\",\n \"share_photo\"\ + : false,\n \"user_id\": 0\n },\n {\n \"message\"\ + : \"How about him?\",\n \"share_photo\": false,\n \"user_id\"\ + : 1\n },\n {\n \"message\": \"He's always had goats and\ + \ chickens, but guess what new animal he got?\",\n \"share_photo\": false,\n\ + \ \"user_id\": 0\n },\n {\n \"message\": \"Dog?\"\ + ,\n \"share_photo\": false,\n \"user_id\": 1\n },\n \ + \ {\n \"message\": \"Nope, a wild hog!\",\n \"share_photo\"\ + : false,\n \"user_id\": 0\n },\n {\n \"message\"\ + : \"And not the motorcycle kind ;)\",\n \"share_photo\": false,\n \ + \ \"user_id\": 0\n },\n {\n \"message\": \"\",\n\ + \ \"share_photo\": true,\n \"user_id\": 0\n },\n \ + \ {\n \"message\": \"Wow\",\n \"share_photo\": false,\n \ + \ \"user_id\": 1\n }\n ],\n \"image_descriptions\": [\n\ + \ \"A photo of the hog's appearance.\",\n \"a photo of wild hog\"\ + ,\n \"An image of the new wild hog\"\n ],\n \"intents\": [\n\ + \ \"Social Bonding\",\n \"Visual Clarification\"\n ],\n \ + \ \"salient_information\": [\n \"hog\",\n \"not the motorcycle\ + \ kind\",\n \"wild hog\",\n \"a wild hog\"\n ],\n \"photo_id\"\ + : \"train/07d688f5e2142b87\"\n },\n \"truncated_cells\": []\n }\n]" +- source_sentence: 'USER_QUERY: kotlin code dataset' + sentences: + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"DucHaiten/Classic-Anime\"\nFEATURES: {'image':\ + \ {'_type': 'Image'}}\nDATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\"\ + : {\n \"image\": {\n \"src\": \"https://datasets-server.huggingface.co/assets/DucHaiten/Classic-Anime/--/8b5b48b361fc115087d3e909f5756f83691dd215/--/default/train/0/image/image.jpg?Expires=1726591575&Signature=s8HUsrjKzPR82e4Z2ivQvcFiaQPhhRtKOhAeOAQv2J667GZW65fWMTXre6-aFpEQUB4m01SIA~Dqn~pDM07eXZhMTWg53y-bg-2ZzqdROTWriUSHNMCF~O1LO9PLJ29Hv6NrHuiCWYZGiB62Xz3442Xp4JbkdoyWH~GjuJuxfF~knZ7TiUvcxv5eBqXFTHYkl4x1isTsv25xhRIfOac0u0zsVG8lO228oYDeSYVqkWyZobB6udMtYo8K4YebHWWaNPKrblmoTW3fbBzllbwxHoH2afSEui~Gy0CHeAerrnlAH7c9f4bG5e~qGx6IgNQSH-hZHXFaEmmIkcLNPd8NCA__&Key-Pair-Id=K3EI6M078Z3AC3\"\ + ,\n \"height\": 1080,\n \"width\": 1920\n }\n },\n \"\ + truncated_cells\": []\n },\n {\n \"row_idx\": 1,\n \"row\": {\n \"\ + image\": {\n \"src\": \"https://datasets-server.huggingface.co/assets/DucHaiten/Classic-Anime/--/8b5b48b361fc115087d3e909f5756f83691dd215/--/default/train/1/image/image.jpg?Expires=1726591575&Signature=I~oedYR11PUVCBujVEn--etHf8sNa8JSR0GaRvZx5qDXXSKIpcPOb3haWO3vtiIVuE-FOxMl-9G4HIQ4v4EbvQDUBimqZytYMD5h86vGxLJYcp9BOeeK6gVjw0b6YGA5z6UmzuJ6Zq4K5GYNjG6C9PjFnr0nFDPAys69Um4z~toHQiPM37S3ilBO9UOk1eKmRge75~-ZEkfOPAsk7PG1Eny2qoLaz7ADmjF-Sm-fXqcBhjLpzhHvMqfq~4Grvq7SY2CUVM-amU0a5Jz6Hul62WhPbtYm8rLqkSVFsj8FK5Mk1UG2PscSUjoMEPVPL6d8T9htkeC8Yj1axBnkHKJXww__&Key-Pair-Id=K3EI6M078Z3AC3\"\ + ,\n \"height\": 1080,\n \"width\": 1440\n }\n },\n \"\ + truncated_cells\": []\n }\n]" + - "NEGATIVE: DATASET_NAME: \"vikp/starcoder_cleaned\"\nFEATURES: {'code': {'dtype':\ + \ 'string', '_type': 'Value'}, 'repo_path': {'dtype': 'string', '_type': 'Value'}}\n\ + DATA SAMPLE:\n[\n {\n \"row_idx\": 0,\n \"row\": {\n \"code\": \"\ + # ---\\n# jupyter:\\n# jupytext:\\n# text_representation:\\n# extension:\ + \ .py\\n# format_name: light\\n# format_version: '1.5'\\n# jupytext_version:\ + \ 1.14.4\\n# kernelspec:\\n# display_name: Python 3\\n# language: python\\\ + n# name: python3\\n# ---\\n\\n# # 09 Strain Gage\\n#\\n# This is one of the\ + \ most commonly used sensor. It is used in many transducers. Its fundamental\ + \ operating principle is fairly easy to understand and it will be the purpose\ + \ of this lecture. \\n#\\n# A strain gage is essentially a thin wire that is wrapped\ + \ on film of plastic. \\n# \\n# The strain gage is then mounted (glued) on the part for which the strain\ + \ must be measured. \\n# \\n#\\n# ## Stress, Strain\\n# When a beam is under axial load, the axial stress,\ + \ $\\\\sigma_a$, is defined as:\\n# \\\\begin{align*}\\n# \\\\sigma_a = \\\\frac{F}{A}\\\ + n# \\\\end{align*}\\n# with $F$ the axial load, and $A$ the cross sectional area\ + \ of the beam under axial load.\\n#\\n# \\n#\\n# Under the load, the beam of length $L$ will extend\ + \ by $dL$, giving rise to the definition of strain, $\\\\epsilon_a$:\\n# \\\\\ + begin{align*}\\n# \\\\epsilon_a = \\\\frac{dL}{L}\\n# \\\\end{align*}\\n# The\ + \ beam will also contract laterally: the cross sectional area is reduced by $dA$.\ + \ This results in a transverval strain $\\\\epsilon_t$. The transversal and\ + \ axial strains are related by the Poisson's ratio:\\n# \\\\begin{align*}\\n#\ + \ \\\\nu = - \\\\frac{\\\\epsilon_t }{\\\\epsilon_a}\\n# \\\\end{align*}\\n# For\ + \ a metal the Poission's ratio is typically $\\\\nu = 0.3$, for an incompressible\ + \ material, such as rubber (or water), $\\\\nu = 0.5$.\\n#\\n# Within the elastic\ + \ limit, the axial stress and axial strain are related through Hooke's law by\ + \ the Young's modulus, $E$:\\n# \\\\begin{align*}\\n# \\\\sigma_a = E \\\\epsilon_a\\\ + n# \\\\end{align*}\\n#\\n# \\n\\n# ## Resistance of a wire\\n#\\n# The electrical resistance of a wire\ + \ $R$ is related to its physical properties (the electrical resistiviy, $\\\\\ + rho$ in $\\\\Omega$/m) and its geometry: length $L$ and cross sectional area $A$.\\\ + n#\\n# \\\\begin{align*}\\n# R = \\\\frac{\\\\rho L}{A}\\n# \\\\end{align*}\\\ + n#\\n# Mathematically, the change in wire dimension will result inchange in its\ + \ electrical resistance. This can be derived from first principle:\\n# \\\\begin{align}\\\ + n# \\\\frac{dR}{R} = \\\\frac{d\\\\rho}{\\\\rho} + \\\\frac{dL}{L} - \\\\frac{dA}{A}\\\ + n# \\\\end{align}\\n# If the wire has a square cross section, then:\\n# \\\\begin{align*}\\\ + n# A & = L'^2 \\\\\\\\\\n# \\\\frac{dA}{A} & = \\\\frac{d(L'^2)}{L'^2} = \\\\\ + frac{2L'dL'}{L'^2} = 2 \\\\frac{dL'}{L'}\\n# \\\\end{align*}\\n# We have related\ + \ the change in cross sectional area to the transversal strain.\\n# \\\\begin{align*}\\\ + n# \\\\epsilon_t = \\\\frac{dL'}{L'}\\n# \\\\end{align*}\\n# Using the Poisson's\ + \ ratio, we can relate then relate the change in cross-sectional area ($dA/A$)\ + \ to axial strain $\\\\epsilon_a = dL/L$.\\n# \\\\begin{align*}\\n# \\\\epsilon_t\ + \ &= - \\\\nu \\\\epsilon_a \\\\\\\\\\n# \\\\frac{dL'}{L'} &= - \\\\nu \\\\frac{dL}{L}\ + \ \\\\; \\\\text{or}\\\\\\\\\\n# \\\\frac{dA}{A} & = 2\\\\frac{dL'}{L'} = -2 \\\ + \\nu \\\\frac{dL}{L}\\n# \\\\end{align*}\\n# Finally we can substitute express\ + \ $dA/A$ in eq. for $dR/R$ and relate change in resistance to change of wire geometry,\ + \ remembering that for a metal $\\\\nu =0.3$:\\n# \\\\begin{align}\\n# \\\\frac{dR}{R}\ + \ & = \\\\frac{d\\\\rho}{\\\\rho} + \\\\frac{dL}{L} - \\\\frac{dA}{A} \\\\\\\\\ + \\n# & = \\\\frac{d\\\\rho}{\\\\rho} + \\\\frac{dL}{L} - (-2\\\\nu \\\\frac{dL}{L})\ + \ \\\\\\\\\\n# & = \\\\frac{d\\\\rho}{\\\\rho} + 1.6 \\\\frac{dL}{L} = \\\\frac{d\\\ + \\rho}{\\\\rho} + 1.6 \\\\epsilon_a\\n# \\\\end{align}\\n# It also happens that\ + \ for most metals, the resistivity increases with axial strain. In general, one\ + \ can then related the change in resistance to axial strain by defining the strain\ + \ gage factor:\\n# \\\\begin{align}\\n# S = 1.6 + \\\\frac{d\\\\rho}{\\\\rho}\\\ + \\cdot \\\\frac{1}{\\\\epsilon_a}\\n# \\\\end{align}\\n# and finally, we have:\\\ + n# \\\\begin{align*}\\n# \\\\frac{dR}{R} = S \\\\epsilon_a\\n# \\\\end{align*}\\\ + n# $S$ is materials dependent and is typically equal to 2.0 for most commercially\ + \ availabe strain gages. It is dimensionless.\\n#\\n# Strain gages are made of\ + \ thin wire that is wraped in several loops, effectively increasing the length\ + \ of the wire and therefore the sensitivity of the sensor.\\n#\\n# _Question:\\\ + n#\\n# Explain why a longer wire is necessary to increase the sensitivity of the\ + \ sensor_.\\n#\\n# Most commercially available strain gages have a nominal resistance\ + \ (resistance under no load, $R_{ini}$) of 120 or 350 $\\\\Omega$.\\n#\\n# Within\ + \ the elastic regime, strain is typically within the range $10^{-6} - 10^{-3}$,\ + \ in fact strain is expressed in unit of microstrain, with a 1 microstrain = $10^{-6}$.\ + \ Therefore, changes in resistances will be of the same order. If one were to\ + \ measure resistances, we will need a dynamic range of 120 dB, whih is typically\ + \ very expensive. Instead, one uses the Wheatstone bridge to transform the change\ + \ in resistance to a voltage, which is easier to measure and does not require\ + \ such a large dynamic range.\\n\\n# ## Wheatstone bridge:\\n# \\n#\\n# The output voltage is related to the difference\ + \ in resistances in the bridge:\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} =\ + \ \\\\frac{R_1R_3-R_2R_4}{(R_1+R_4)(R_2+R_3)}\\n# \\\\end{align*}\\n#\\n# If the\ + \ bridge is balanced, then $V_o = 0$, it implies: $R_1/R_2 = R_4/R_3$.\\n#\\n#\ + \ In practice, finding a set of resistors that balances the bridge is challenging,\ + \ and a potentiometer is used as one of the resistances to do minor adjustement\ + \ to balance the bridge. If one did not do the adjustement (ie if we did not\ + \ zero the bridge) then all the measurement will have an offset or bias that could\ + \ be removed in a post-processing phase, as long as the bias stayed constant.\\\ + n#\\n# If each resistance $R_i$ is made to vary slightly around its initial value,\ + \ ie $R_i = R_{i,ini} + dR_i$. For simplicity, we will assume that the initial\ + \ value of the four resistances are equal, ie $R_{1,ini} = R_{2,ini} = R_{3,ini}\ + \ = R_{4,ini} = R_{ini}$. This implies that the bridge was initially balanced,\ + \ then the output voltage would be:\\n#\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s}\ + \ = \\\\frac{1}{4} \\\\left( \\\\frac{dR_1}{R_{ini}} - \\\\frac{dR_2}{R_{ini}}\ + \ + \\\\frac{dR_3}{R_{ini}} - \\\\frac{dR_4}{R_{ini}} \\\\right)\\n# \\\\end{align*}\\\ + n#\\n# Note here that the changes in $R_1$ and $R_3$ have a positive effect on\ + \ $V_o$, while the changes in $R_2$ and $R_4$ have a negative effect on $V_o$.\ + \ In practice, this means that is a beam is a in tension, then a strain gage\ + \ mounted on the branch 1 or 3 of the Wheatstone bridge will produce a positive\ + \ voltage, while a strain gage mounted on branch 2 or 4 will produce a negative\ + \ voltage. One takes advantage of this to increase sensitivity to measure strain.\\\ + n#\\n# ### Quarter bridge\\n# One uses only one quarter of the bridge, ie strain\ + \ gages are only mounted on one branch of the bridge.\\n#\\n# \\\\begin{align*}\\\ + n# \\\\frac{V_o}{V_s} = \\\\pm \\\\frac{1}{4} \\\\epsilon_a S\\n# \\\\end{align*}\\\ + n# Sensitivity, $G$:\\n# \\\\begin{align*}\\n# G = \\\\frac{V_o}{\\\\epsilon_a}\ + \ = \\\\pm \\\\frac{1}{4}S V_s\\n# \\\\end{align*}\\n#\\n#\\n# ### Half bridge\\\ + n# One uses half of the bridge, ie strain gages are mounted on two branches of\ + \ the bridge.\\n#\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} = \\\\pm \\\\frac{1}{2}\ + \ \\\\epsilon_a S\\n# \\\\end{align*}\\n#\\n# ### Full bridge\\n#\\n# One uses\ + \ of the branches of the bridge, ie strain gages are mounted on each branch.\\\ + n#\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} = \\\\pm \\\\epsilon_a S\\n# \\\ + \\end{align*}\\n#\\n# Therefore, as we increase the order of bridge, the sensitivity\ + \ of the instrument increases. However, one should be carefull how we mount the\ + \ strain gages as to not cancel out their measurement.\\n\\n# _Exercise_\\n#\\\ + n# 1- Wheatstone bridge\\n#\\n# \\n#\\n# > How important is it to know \\\\& match the resistances of\ + \ the resistors you employ to create your bridge?\\n# > How would you do that\ + \ practically?\\n# > Assume $R_1=120\\\\,\\\\Omega$, $R_2=120\\\\,\\\\Omega$,\ + \ $R_3=120\\\\,\\\\Omega$, $R_4=110\\\\,\\\\Omega$, $V_s=5.00\\\\,\\\\text{V}$.\ + \ What is $V_\\\\circ$?\\n\\nVs = 5.00\\nVo = (120**2-120*110)/(230*240) * Vs\\\ + nprint('Vo = ',Vo, ' V')\\n\\n# typical range in strain a strain gauge can measure\\\ + n# 1 -1000 micro-Strain\\nAxialStrain = 1000*10**(-6) # axial strain\\nStrainGageFactor\ + \ = 2\\nR_ini = 120 # Ohm\\nR_1 = R_ini+R_ini*StrainGageFactor*AxialStrain\\nprint(R_1)\\\ + nVo = (120**2-120*(R_1))/((120+R_1)*240) * Vs\\nprint('Vo = ', Vo, ' V')\\n\\\ + n# > How important is it to know \\\\& match the resistances of the resistors\ + \ you employ to create your bridge?\\n# > How would you do that practically?\\\ + n# > Assume $R_1= R_2 =R_3=120\\\\,\\\\Omega$, $R_4=120.01\\\\,\\\\Omega$, $V_s=5.00\\\ + \\,\\\\text{V}$. What is $V_\\\\circ$?\\n\\nVs = 5.00\\nVo = (120**2-120*120.01)/(240.01*240)\ + \ * Vs\\nprint(Vo)\\n\\n# 2- Strain gage 1:\\n#\\n# One measures the strain on\ + \ a bridge steel beam. The modulus of elasticity is $E=190$ GPa. Only one strain\ + \ gage is mounted on the bottom of the beam; the strain gage factor is $S=2.02$.\\\ + n#\\n# > a) What kind of electronic circuit will you use? Draw a sketch of it.\\\ + n#\\n# > b) Assume all your resistors including the unloaded strain gage are balanced\ + \ and measure $120\\\\,\\\\Omega$, and that the strain gage is at location $R_2$.\ + \ The supply voltage is $5.00\\\\,\\\\text{VDC}$. Will $V_\\\\circ$ be positive\ + \ or negative when a downward load is added?\\n\\n# In practice, we cannot have\ + \ all resistances = 120 $\\\\Omega$. at zero load, the bridge will be unbalanced\ + \ (show $V_o \\\\neq 0$). How could we balance our bridge?\\n#\\n# Use a potentiometer\ + \ to balance bridge, for the load cell, we ''zero'' the instrument.\\n#\\n# Other\ + \ option to zero-out our instrument? Take data at zero-load, record the voltage,\ + \ $V_{o,noload}$. Substract $V_{o,noload}$ to my data.\\n\\n# > c) For a loading\ + \ in which $V_\\\\circ = -1.25\\\\,\\\\text{mV}$, calculate the strain $\\\\epsilon_a$\ + \ in units of microstrain.\\n\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} & =\ + \ - \\\\frac{1}{4} \\\\epsilon_a S\\\\\\\\\\n# \\\\epsilon_a & = -\\\\frac{4}{S}\ + \ \\\\frac{V_o}{V_s}\\n# \\\\end{align*}\\n\\nS = 2.02\\nVo = -0.00125\\nVs =\ + \ 5\\neps_a = -1*(4/S)*(Vo/Vs)\\nprint(eps_a)\\n\\n# > d) Calculate the axial\ + \ stress (in MPa) in the beam under this load.\\n\\n\\n\\n# > e) You now want\ + \ more sensitivity in your measurement, you install a second strain gage on to\\\ + n\\n# p of the beam. Which resistor should you use for this second active strain\ + \ gage?\\n#\\n# > f) With this new setup and the same applied load than previously,\ + \ what should be the output voltage?\\n\\n# 3- Strain Gage with Long Lead Wires\ + \ \\n#\\n# \\n#\\\ + n# A quarter bridge strain gage Wheatstone bridge circuit is constructed with\ + \ $120\\\\,\\\\Omega$ resistors and a $120\\\\,\\\\Omega$ strain gage. For this\ + \ practical application, the strain gage is located very far away form the DAQ\ + \ station and the lead wires to the strain gage are $10\\\\,\\\\text{m}$ long\ + \ and the lead wire have a resistance of $0.080\\\\,\\\\Omega/\\\\text{m}$. The\ + \ lead wire resistance can lead to problems since $R_{lead}$ changes with temperature.\\\ + n#\\n# > Design a modified circuit that will cancel out the effect of the lead\ + \ wires.\\n\\n# ## Homework\\n#\\n\",\n \"repo_path\": \"Lectures/09_StrainGage.ipynb\"\ + \n },\n \"truncated_cells\": []\n },\n {\n \"row_idx\": 1,\n \"\ + row\": {\n \"code\": \"# ---\\n# jupyter:\\n# jupytext:\\n# split_at_heading:\ + \ true\\n# text_representation:\\n# extension: .py\\n# format_name:\ + \ light\\n# format_version: '1.5'\\n# jupytext_version: 1.14.4\\n#\ + \ kernelspec:\\n# display_name: Python 3\\n# language: python\\n# \ + \ name: python3\\n# ---\\n\\n#export\\nfrom fastai.basics import *\\nfrom fastai.tabular.core\ + \ import *\\nfrom fastai.tabular.model import *\\n\\nfrom fastai.tabular.data\ + \ import *\\n\\n#hide\\nfrom nbdev.showdoc import *\\n\\n\\n# +\\n#default_exp\ + \ tabular.learner\\n# -\\n\\n# # Tabular learner\\n#\\n# > The function to immediately\ + \ get a `Learner` ready to train for tabular data\\n\\n# The main function you\ + \ probably want to use in this module is `tabular_learner`. It will automatically\ + \ create a `TabulaModel` suitable for your data and infer the irght loss function.\ + \ See the [tabular tutorial](http://docs.fast.ai/tutorial.tabular) for an example\ + \ of use in context.\\n\\n# ## Main functions\\n\\n#export\\n@log_args(but_as=Learner.__init__)\\\ + nclass TabularLearner(Learner):\\n \\\"`Learner` for tabular data\\\"\\n \ + \ def predict(self, row):\\n tst_to = self.dls.valid_ds.new(pd.DataFrame(row).T)\\\ + n tst_to.process()\\n tst_to.conts = tst_to.conts.astype(np.float32)\\\ + n dl = self.dls.valid.new(tst_to)\\n inp,preds,_,dec_preds = self.get_preds(dl=dl,\ + \ with_input=True, with_decoded=True)\\n i = getattr(self.dls, 'n_inp',\ + \ -1)\\n b = (*tuplify(inp),*tuplify(dec_preds))\\n full_dec = self.dls.decode((*tuplify(inp),*tuplify(dec_preds)))\\\ + n return full_dec,dec_preds[0],preds[0]\\n\\n\\nshow_doc(TabularLearner,\ + \ title_level=3)\\n\\n\\n# It works exactly as a normal `Learner`, the only difference\ + \ is that it implements a `predict` method specific to work on a row of data.\\\ + n\\n#export\\n@log_args(to_return=True, but_as=Learner.__init__)\\n@delegates(Learner.__init__)\\\ + ndef tabular_learner(dls, layers=None, emb_szs=None, config=None, n_out=None,\ + \ y_range=None, **kwargs):\\n \\\"Get a `Learner` using `dls`, with `metrics`,\ + \ including a `TabularModel` created using the remaining params.\\\"\\n if\ + \ config is None: config = tabular_config()\\n if layers is None: layers =\ + \ [200,100]\\n to = dls.train_ds\\n emb_szs = get_emb_sz(dls.train_ds, {}\ + \ if emb_szs is None else emb_szs)\\n if n_out is None: n_out = get_c(dls)\\\ + n assert n_out, \\\"`n_out` is not defined, and could not be infered from data,\ + \ set `dls.c` or pass `n_out`\\\"\\n if y_range is None and 'y_range' in config:\ + \ y_range = config.pop('y_range')\\n model = TabularModel(emb_szs, len(dls.cont_names),\ + \ n_out, layers, y_range=y_range, **config)\\n return TabularLearner(dls, model,\ + \ **kwargs)\\n\\n\\n# If your data was built with fastai, you probably won't need\ + \ to pass anything to `emb_szs` unless you want to change the default of the library\ + \ (produced by `get_emb_sz`), same for `n_out` which should be automatically inferred.\ + \ `layers` will default to `[200,100]` and is passed to `TabularModel` along with\ + \ the `config`.\\n#\\n# Use `tabular_config` to create a `config` and cusotmize\ + \ the model used. There is just easy access to `y_range` because this argument\ + \ is often used.\\n#\\n# All the other arguments are passed to `Learner`.\\n\\\ + npath = untar_data(URLs.ADULT_SAMPLE)\\ndf = pd.read_csv(path/'adult.csv')\\ncat_names\ + \ = ['workclass', 'education', 'marital-status', 'occupation', 'relationship',\ + \ 'race']\\ncont_names = ['age', 'fnlwgt', 'education-num']\\nprocs = [Categorify,\ + \ FillMissing, Normalize]\\ndls = TabularDataLoaders.from_df(df, path, procs=procs,\ + \ cat_names=cat_names, cont_names=cont_names, \\n \ + \ y_names=\\\"salary\\\", valid_idx=list(range(800,1000)), bs=64)\\nlearn\ + \ = tabular_learner(dls)\\n\\n#hide\\ntst = learn.predict(df.iloc[0])\\n\\n# +\\\ + n#hide\\n#test y_range is passed\\nlearn = tabular_learner(dls, y_range=(0,32))\\\ + nassert isinstance(learn.model.layers[-1], SigmoidRange)\\ntest_eq(learn.model.layers[-1].low,\ + \ 0)\\ntest_eq(learn.model.layers[-1].high, 32)\\n\\nlearn = tabular_learner(dls,\ + \ config = tabular_config(y_range=(0,32)))\\nassert isinstance(learn.model.layers[-1],\ + \ SigmoidRange)\\ntest_eq(learn.model.layers[-1].low, 0)\\ntest_eq(learn.model.layers[-1].high,\ + \ 32)\\n\\n\\n# -\\n\\n#export\\n@typedispatch\\ndef show_results(x:Tabular, y:Tabular,\ + \ samples, outs, ctxs=None, max_n=10, **kwargs):\\n df = x.all_cols[:max_n]\\\ + n for n in x.y_names: df[n+'_pred'] = y[n][:max_n].values\\n display_df(df)\\\ + n\\n\\n# ## Export -\\n\\n#hide\\nfrom nbdev.export import notebook2script\\nnotebook2script()\\\ + n\\n\\n\",\n \"repo_path\": \"nbs/43_tabular.learner.ipynb\"\n },\n \ + \ \"truncated_cells\": []\n }\n]" + - "HUB_DATASET_PREVIEW: DATASET_NAME: \"mvasiliniuc/iva-kotlin-codeint\"\nFEATURES:\ + \ {'repo_name': {'dtype': 'string', '_type': 'Value'}, 'path': {'dtype': 'string',\ + \ '_type': 'Value'}, 'copies': {'dtype': 'string', '_type': 'Value'}, 'size':\ + \ {'dtype': 'string', '_type': 'Value'}, 'content': {'dtype': 'string', '_type':\ + \ 'Value'}, 'license': {'dtype': 'string', '_type': 'Value'}}\nDATA SAMPLE:\n\ + [\n {\n \"row_idx\": 0,\n \"row\": {\n \"repo_name\": \"Cognifide/gradle-aem-plugin\"\ + ,\n \"path\": \"src/main/kotlin/com/cognifide/gradle/aem/instance/tasks/InstanceReload.kt\"\ + ,\n \"copies\": \"1\",\n \"size\": \"1052\",\n \"content\": \"\ + package com.cognifide.gradle.aem.instance.tasks\\n\\nimport com.cognifide.gradle.aem.common.instance.action.AwaitUpAction\\\ + nimport com.cognifide.gradle.aem.common.instance.action.ReloadAction\\nimport\ + \ com.cognifide.gradle.aem.common.instance.names\\nimport com.cognifide.gradle.aem.common.tasks.Instance\\\ + nimport org.gradle.api.tasks.TaskAction\\n\\nopen class InstanceReload : Instance()\ + \ {\\n\\n private var reloadOptions: ReloadAction.() -> Unit = {}\\n\\n \ + \ fun reload(options: ReloadAction.() -> Unit) {\\n this.reloadOptions\ + \ = options\\n }\\n\\n private var awaitUpOptions: AwaitUpAction.() -> Unit\ + \ = {}\\n\\n fun awaitUp(options: AwaitUpAction.() -> Unit) {\\n this.awaitUpOptions\ + \ = options\\n }\\n\\n @TaskAction\\n fun reload() {\\n instanceManager.awaitReloaded(anyInstances,\ + \ reloadOptions, awaitUpOptions)\\n common.notifier.lifecycle(\\\"Instance(s)\ + \ reloaded\\\", \\\"Which: ${anyInstances.names}\\\")\\n }\\n\\n init {\\\ + n description = \\\"Reloads all AEM instance(s).\\\"\\n }\\n\\n companion\ + \ object {\\n const val NAME = \\\"instanceReload\\\"\\n }\\n}\\n\"\ + ,\n \"license\": \"apache-2.0\"\n },\n \"truncated_cells\": []\n },\n\ + \ {\n \"row_idx\": 1,\n \"row\": {\n \"repo_name\": \"80998062/Fank\"\ + ,\n \"path\": \"presentation/src/main/java/com/sinyuk/fanfou/ui/status/StatusView.kt\"\ + ,\n \"copies\": \"1\",\n \"size\": \"8490\",\n \"content\": \"\ + /*\\n *\\n * * Apache License\\n * *\\n * * Copyright [2017] Sinyuk\\n * *\\\ + n * * Licensed under the Apache License, Version 2.0 (the \\\"License\\\");\\\ + n * * you may not use this file except in compliance with the License.\\n * \ + \ * You may obtain a copy of the License at\\n * *\\n * * http://www.apache.org/licenses/LICENSE-2.0\\\ + n * *\\n * * Unless required by applicable law or agreed to in writing, software\\\ + n * * distributed under the License is distributed on an \\\"AS IS\\\" BASIS,\\\ + n * * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\\\ + n * * See the License for the specific language governing permissions and\\n\ + \ * * limitations under the License.\\n *\\n */\\n\\npackage com.sinyuk.fanfou.ui.status\\\ + n\\nimport android.os.Build\\nimport android.os.Bundle\\nimport android.support.v4.app.Fragment\\\ + nimport android.support.v4.app.FragmentPagerAdapter\\nimport android.text.Editable\\\ + nimport android.text.TextWatcher\\nimport android.view.View\\nimport android.view.ViewTreeObserver\\\ + nimport cn.dreamtobe.kpswitch.util.KeyboardUtil\\nimport com.linkedin.android.spyglass.suggestions.SuggestionsResult\\\ + nimport com.linkedin.android.spyglass.suggestions.interfaces.Suggestible\\nimport\ + \ com.linkedin.android.spyglass.suggestions.interfaces.SuggestionsResultListener\\\ + nimport com.linkedin.android.spyglass.suggestions.interfaces.SuggestionsVisibilityManager\\\ + nimport com.linkedin.android.spyglass.tokenization.QueryToken\\nimport com.linkedin.android.spyglass.tokenization.impl.WordTokenizer\\\ + nimport com.linkedin.android.spyglass.tokenization.impl.WordTokenizerConfig\\\ + nimport com.linkedin.android.spyglass.tokenization.interfaces.QueryTokenReceiver\\\ + nimport com.sinyuk.fanfou.R\\nimport com.sinyuk.fanfou.base.AbstractActivity\\\ + nimport com.sinyuk.fanfou.base.AbstractFragment\\nimport com.sinyuk.fanfou.di.Injectable\\\ + nimport com.sinyuk.fanfou.domain.DO.Player\\nimport com.sinyuk.fanfou.domain.DO.Status\\\ + nimport com.sinyuk.fanfou.domain.STATUS_LIMIT\\nimport com.sinyuk.fanfou.domain.StatusCreation\\\ + nimport com.sinyuk.fanfou.domain.TIMELINE_CONTEXT\\nimport com.sinyuk.fanfou.ui.editor.EditorView\\\ + nimport com.sinyuk.fanfou.ui.editor.MentionListView\\nimport com.sinyuk.fanfou.ui.timeline.TimelineView\\\ + nimport com.sinyuk.fanfou.util.obtainViewModelFromActivity\\nimport com.sinyuk.fanfou.viewmodel.FanfouViewModelFactory\\\ + nimport com.sinyuk.fanfou.viewmodel.PlayerViewModel\\nimport kotlinx.android.synthetic.main.status_view.*\\\ + nimport kotlinx.android.synthetic.main.status_view_footer.*\\nimport kotlinx.android.synthetic.main.status_view_reply_actionbar.*\\\ + nimport javax.inject.Inject\\n\\n\\n/**\\n * Created by sinyuk on 2018/1/12.\\\ + n *\\n */\\nclass StatusView : AbstractFragment(), Injectable, QueryTokenReceiver,\ + \ SuggestionsResultListener, SuggestionsVisibilityManager {\\n\\n companion\ + \ object {\\n fun newInstance(status: Status, photoExtra: Bundle? = null)\ + \ = StatusView().apply {\\n arguments = Bundle().apply {\\n \ + \ putParcelable(\\\"status\\\", status)\\n putBundle(\\\ + \"photoExtra\\\", photoExtra)\\n }\\n }\\n }\\n\\n override\ + \ fun layoutId() = R.layout.status_view\\n\\n @Inject\\n lateinit var factory:\ + \ FanfouViewModelFactory\\n\\n private val playerViewModel by lazy { obtainViewModelFromActivity(factory,\ + \ PlayerViewModel::class.java) }\\n\\n override fun onEnterAnimationEnd(savedInstanceState:\ + \ Bundle?) {\\n super.onEnterAnimationEnd(savedInstanceState)\\n \ + \ navBack.setOnClickListener { onBackPressedSupport() }\\n setupEditor()\\\ + n setupKeyboard()\\n onTextChanged(0)\\n setupViewPager()\\\ + n\\n val status = arguments!!.getParcelable(\\\"status\\\")\\n\ + \ fullscreenButton.setOnClickListener {\\n (activity as AbstractActivity).start(EditorView.newInstance(status.id,\\\ + n replyEt.mentionsText,\\n StatusCreation.REPOST_STATUS))\\\ + n replyEt.text = null\\n }\\n }\\n\\n private fun setupViewPager()\ + \ {\\n val status = arguments!!.getParcelable(\\\"status\\\")\\\ + n val bundle = arguments!!.getBundle(\\\"photoExtra\\\")\\n val\ + \ fragments: List = if (findChildFragment(TimelineView::class.java)\ + \ == null) {\\n val mentionView = MentionListView()\\n mentionView.onItemClickListener\ + \ = onSuggestionSelectListener\\n mutableListOf(TimelineView.contextTimeline(TIMELINE_CONTEXT,\ + \ status, bundle), mentionView)\\n } else {\\n mutableListOf(findChildFragment(TimelineView::class.java),\ + \ MentionListView())\\n }\\n\\n viewPager.setPagingEnabled(false)\\\ + n viewPager.offscreenPageLimit = 1\\n viewPager.adapter = object\ + \ : FragmentPagerAdapter(childFragmentManager) {\\n override fun getItem(position:\ + \ Int) = fragments[position]\\n\\n override fun getCount() = fragments.size\\\ + n }\\n }\\n\\n private var keyboardListener: ViewTreeObserver.OnGlobalLayoutListener?\ + \ = null\\n\\n private fun setupKeyboard() {\\n keyboardListener = KeyboardUtil.attach(activity,\ + \ panelRoot, {\\n // TODO: how comes the Exception: panelRootContainer\ + \ must not be null\\n panelRootContainer?.visibility =\\n \ + \ if (it) {\\n if (replyEt.requestFocus()) replyEt.setSelection(replyEt.text.length)\\\ + n View.VISIBLE\\n } else {\\n \ + \ replyEt.clearFocus()\\n View.GONE\\\ + n }\\n })\\n }\\n\\n private val config = WordTokenizerConfig.Builder()\\\ + n .setExplicitChars(\\\"@\\\")\\n .setThreshold(3)\\n \ + \ .setMaxNumKeywords(5)\\n .setWordBreakChars(\\\" \\\").build()\\\ + n\\n private fun setupEditor() {\\n replyEt.tokenizer = WordTokenizer(config)\\\ + n replyEt.setAvoidPrefixOnTap(true)\\n replyEt.setQueryTokenReceiver(this)\\\ + n replyEt.setSuggestionsVisibilityManager(this)\\n replyEt.setAvoidPrefixOnTap(true)\\\ + n\\n replyCommitButton.setOnClickListener { }\\n\\n if (Build.VERSION.SDK_INT\ + \ >= Build.VERSION_CODES.O)\\n textCountProgress.min = 0\\n \ + \ textCountProgress.max = STATUS_LIMIT\\n replyEt.addTextChangedListener(object\ + \ : TextWatcher {\\n override fun afterTextChanged(s: Editable?) {\\\ + n onTextChanged(s?.length ?: 0)\\n }\\n\\n \ + \ override fun beforeTextChanged(s: CharSequence?, start: Int, count: Int, after:\ + \ Int) {\\n\\n }\\n\\n override fun onTextChanged(s: CharSequence?,\ + \ start: Int, before: Int, count: Int) {\\n\\n }\\n })\\n \ + \ }\\n\\n\\n /**\\n * @param count \\u5b57\\u6570\\n */\\n private\ + \ fun onTextChanged(count: Int) {\\n textCountProgress.progress = count\\\ + n replyCommitButton.isEnabled = count in 1..STATUS_LIMIT\\n }\\n\\n\\\ + n private val onSuggestionSelectListener = object : MentionListView.OnItemClickListener\ + \ {\\n override fun onItemClick(position: Int, item: Suggestible) {\\n\ + \ (item as Player).let {\\n replyEt.insertMention(it)\\\ + n displaySuggestions(false)\\n playerViewModel.updateMentionedAt(it)\ + \ //\\n onTextChanged(replyEt.text.length)\\n replyEt.requestFocus()\\\ + n replyEt.setSelection(replyEt.text.length)\\n }\\n\ + \ }\\n }\\n\\n @Suppress(\\\"PrivatePropertyName\\\")\\n private\ + \ val BUCKET = \\\"player-mentioned\\\"\\n\\n override fun onQueryReceived(queryToken:\ + \ QueryToken): MutableList {\\n val data = playerViewModel.filter(queryToken.keywords)\\\ + n onReceiveSuggestionsResult(SuggestionsResult(queryToken, data), BUCKET)\\\ + n return arrayOf(BUCKET).toMutableList()\\n }\\n\\n override fun\ + \ onReceiveSuggestionsResult(result: SuggestionsResult, bucket: String) {\\n \ + \ val data = result.suggestions\\n if (data?.isEmpty() != false)\ + \ return\\n displaySuggestions(true)\\n findChildFragment(MentionListView::class.java).setData(data)\\\ + n }\\n\\n override fun displaySuggestions(display: Boolean) {\\n \ + \ viewPager.setCurrentItem(if (display) 1 else 0, true)\\n }\\n\\n override\ + \ fun isDisplayingSuggestions() = viewPager.currentItem == 1\\n\\n override\ + \ fun onBackPressedSupport(): Boolean {\\n when {\\n panelRootContainer.visibility\ + \ == View.VISIBLE -> KeyboardUtil.hideKeyboard(panelRootContainer)\\n \ + \ isDisplayingSuggestions -> displaySuggestions(false)\\n else ->\ + \ pop()\\n }\\n return true\\n\\n }\\n\\n override fun onDestroy()\ + \ {\\n keyboardListener?.let { KeyboardUtil.detach(activity, it) }\\n \ + \ activity?.currentFocus?.let { KeyboardUtil.hideKeyboard(it) }\\n \ + \ super.onDestroy()\\n }\\n\\n}\",\n \"license\": \"mit\"\n },\n\ + \ \"truncated_cells\": []\n }\n]" +model-index: +- name: Alibaba-NLP/gte-base-en-v1.5 trained on query-to-dataset-viewer-descriptions + results: + - task: + type: triplet + name: Triplet + dataset: + name: Unknown + type: unknown + metrics: + - type: cosine_accuracy + value: 1.0 + name: Cosine Accuracy + - type: dot_accuracy + value: 0.0 + name: Dot Accuracy + - type: manhattan_accuracy + value: 1.0 + name: Manhattan Accuracy + - type: euclidean_accuracy + value: 1.0 + name: Euclidean Accuracy + - type: max_accuracy + value: 1.0 + name: Max Accuracy +--- + +# Alibaba-NLP/gte-base-en-v1.5 trained on query-to-dataset-viewer-descriptions + +This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Alibaba-NLP/gte-base-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-base-en-v1.5) on the [query-to-dataset-viewer-descriptions](https://huggingface.co/datasets/davanstrien/query-to-dataset-viewer-descriptions) dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more. + +## Model Details + +### Model Description +- **Model Type:** Sentence Transformer +- **Base model:** [Alibaba-NLP/gte-base-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-base-en-v1.5) +- **Maximum Sequence Length:** 8192 tokens +- **Output Dimensionality:** 768 tokens +- **Similarity Function:** Cosine Similarity +- **Training Dataset:** + - [query-to-dataset-viewer-descriptions](https://huggingface.co/datasets/davanstrien/query-to-dataset-viewer-descriptions) +- **Language:** en +- **License:** apache-2.0 + +### Model Sources + +- **Documentation:** [Sentence Transformers Documentation](https://sbert.net) +- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers) +- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers) + +### Full Model Architecture + +``` +SentenceTransformer( + (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: NewModel + (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True}) +) +``` + +## Usage + +### Direct Usage (Sentence Transformers) + +First install the Sentence Transformers library: + +```bash +pip install -U sentence-transformers +``` + +Then you can load this model and run inference. +```python +from sentence_transformers import SentenceTransformer + +# Download from the 🤗 Hub +model = SentenceTransformer("query-to-dataset-viewer-descriptions") +# Run inference +sentences = [ + 'USER_QUERY: kotlin code dataset', + 'HUB_DATASET_PREVIEW: DATASET_NAME: "mvasiliniuc/iva-kotlin-codeint"\nFEATURES: {\'repo_name\': {\'dtype\': \'string\', \'_type\': \'Value\'}, \'path\': {\'dtype\': \'string\', \'_type\': \'Value\'}, \'copies\': {\'dtype\': \'string\', \'_type\': \'Value\'}, \'size\': {\'dtype\': \'string\', \'_type\': \'Value\'}, \'content\': {\'dtype\': \'string\', \'_type\': \'Value\'}, \'license\': {\'dtype\': \'string\', \'_type\': \'Value\'}}\nDATA SAMPLE:\n[\n {\n "row_idx": 0,\n "row": {\n "repo_name": "Cognifide/gradle-aem-plugin",\n "path": "src/main/kotlin/com/cognifide/gradle/aem/instance/tasks/InstanceReload.kt",\n "copies": "1",\n "size": "1052",\n "content": "package com.cognifide.gradle.aem.instance.tasks\\n\\nimport com.cognifide.gradle.aem.common.instance.action.AwaitUpAction\\nimport com.cognifide.gradle.aem.common.instance.action.ReloadAction\\nimport com.cognifide.gradle.aem.common.instance.names\\nimport com.cognifide.gradle.aem.common.tasks.Instance\\nimport org.gradle.api.tasks.TaskAction\\n\\nopen class InstanceReload : Instance() {\\n\\n private var reloadOptions: ReloadAction.() -> Unit = {}\\n\\n fun reload(options: ReloadAction.() -> Unit) {\\n this.reloadOptions = options\\n }\\n\\n private var awaitUpOptions: AwaitUpAction.() -> Unit = {}\\n\\n fun awaitUp(options: AwaitUpAction.() -> Unit) {\\n this.awaitUpOptions = options\\n }\\n\\n @TaskAction\\n fun reload() {\\n instanceManager.awaitReloaded(anyInstances, reloadOptions, awaitUpOptions)\\n common.notifier.lifecycle(\\"Instance(s) reloaded\\", \\"Which: ${anyInstances.names}\\")\\n }\\n\\n init {\\n description = \\"Reloads all AEM instance(s).\\"\\n }\\n\\n companion object {\\n const val NAME = \\"instanceReload\\"\\n }\\n}\\n",\n "license": "apache-2.0"\n },\n "truncated_cells": []\n },\n {\n "row_idx": 1,\n "row": {\n "repo_name": "80998062/Fank",\n "path": "presentation/src/main/java/com/sinyuk/fanfou/ui/status/StatusView.kt",\n "copies": "1",\n "size": "8490",\n "content": "/*\\n *\\n * * Apache License\\n * *\\n * * Copyright [2017] Sinyuk\\n * *\\n * * Licensed under the Apache License, Version 2.0 (the \\"License\\");\\n * * you may not use this file except in compliance with the License.\\n * * You may obtain a copy of the License at\\n * *\\n * * http://www.apache.org/licenses/LICENSE-2.0\\n * *\\n * * Unless required by applicable law or agreed to in writing, software\\n * * distributed under the License is distributed on an \\"AS IS\\" BASIS,\\n * * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\\n * * See the License for the specific language governing permissions and\\n * * limitations under the License.\\n *\\n */\\n\\npackage com.sinyuk.fanfou.ui.status\\n\\nimport android.os.Build\\nimport android.os.Bundle\\nimport android.support.v4.app.Fragment\\nimport android.support.v4.app.FragmentPagerAdapter\\nimport android.text.Editable\\nimport android.text.TextWatcher\\nimport android.view.View\\nimport android.view.ViewTreeObserver\\nimport cn.dreamtobe.kpswitch.util.KeyboardUtil\\nimport com.linkedin.android.spyglass.suggestions.SuggestionsResult\\nimport com.linkedin.android.spyglass.suggestions.interfaces.Suggestible\\nimport com.linkedin.android.spyglass.suggestions.interfaces.SuggestionsResultListener\\nimport com.linkedin.android.spyglass.suggestions.interfaces.SuggestionsVisibilityManager\\nimport com.linkedin.android.spyglass.tokenization.QueryToken\\nimport com.linkedin.android.spyglass.tokenization.impl.WordTokenizer\\nimport com.linkedin.android.spyglass.tokenization.impl.WordTokenizerConfig\\nimport com.linkedin.android.spyglass.tokenization.interfaces.QueryTokenReceiver\\nimport com.sinyuk.fanfou.R\\nimport com.sinyuk.fanfou.base.AbstractActivity\\nimport com.sinyuk.fanfou.base.AbstractFragment\\nimport com.sinyuk.fanfou.di.Injectable\\nimport com.sinyuk.fanfou.domain.DO.Player\\nimport com.sinyuk.fanfou.domain.DO.Status\\nimport com.sinyuk.fanfou.domain.STATUS_LIMIT\\nimport com.sinyuk.fanfou.domain.StatusCreation\\nimport com.sinyuk.fanfou.domain.TIMELINE_CONTEXT\\nimport com.sinyuk.fanfou.ui.editor.EditorView\\nimport com.sinyuk.fanfou.ui.editor.MentionListView\\nimport com.sinyuk.fanfou.ui.timeline.TimelineView\\nimport com.sinyuk.fanfou.util.obtainViewModelFromActivity\\nimport com.sinyuk.fanfou.viewmodel.FanfouViewModelFactory\\nimport com.sinyuk.fanfou.viewmodel.PlayerViewModel\\nimport kotlinx.android.synthetic.main.status_view.*\\nimport kotlinx.android.synthetic.main.status_view_footer.*\\nimport kotlinx.android.synthetic.main.status_view_reply_actionbar.*\\nimport javax.inject.Inject\\n\\n\\n/**\\n * Created by sinyuk on 2018/1/12.\\n *\\n */\\nclass StatusView : AbstractFragment(), Injectable, QueryTokenReceiver, SuggestionsResultListener, SuggestionsVisibilityManager {\\n\\n companion object {\\n fun newInstance(status: Status, photoExtra: Bundle? = null) = StatusView().apply {\\n arguments = Bundle().apply {\\n putParcelable(\\"status\\", status)\\n putBundle(\\"photoExtra\\", photoExtra)\\n }\\n }\\n }\\n\\n override fun layoutId() = R.layout.status_view\\n\\n @Inject\\n lateinit var factory: FanfouViewModelFactory\\n\\n private val playerViewModel by lazy { obtainViewModelFromActivity(factory, PlayerViewModel::class.java) }\\n\\n override fun onEnterAnimationEnd(savedInstanceState: Bundle?) {\\n super.onEnterAnimationEnd(savedInstanceState)\\n navBack.setOnClickListener { onBackPressedSupport() }\\n setupEditor()\\n setupKeyboard()\\n onTextChanged(0)\\n setupViewPager()\\n\\n val status = arguments!!.getParcelable(\\"status\\")\\n fullscreenButton.setOnClickListener {\\n (activity as AbstractActivity).start(EditorView.newInstance(status.id,\\n replyEt.mentionsText,\\n StatusCreation.REPOST_STATUS))\\n replyEt.text = null\\n }\\n }\\n\\n private fun setupViewPager() {\\n val status = arguments!!.getParcelable(\\"status\\")\\n val bundle = arguments!!.getBundle(\\"photoExtra\\")\\n val fragments: List = if (findChildFragment(TimelineView::class.java) == null) {\\n val mentionView = MentionListView()\\n mentionView.onItemClickListener = onSuggestionSelectListener\\n mutableListOf(TimelineView.contextTimeline(TIMELINE_CONTEXT, status, bundle), mentionView)\\n } else {\\n mutableListOf(findChildFragment(TimelineView::class.java), MentionListView())\\n }\\n\\n viewPager.setPagingEnabled(false)\\n viewPager.offscreenPageLimit = 1\\n viewPager.adapter = object : FragmentPagerAdapter(childFragmentManager) {\\n override fun getItem(position: Int) = fragments[position]\\n\\n override fun getCount() = fragments.size\\n }\\n }\\n\\n private var keyboardListener: ViewTreeObserver.OnGlobalLayoutListener? = null\\n\\n private fun setupKeyboard() {\\n keyboardListener = KeyboardUtil.attach(activity, panelRoot, {\\n // TODO: how comes the Exception: panelRootContainer must not be null\\n panelRootContainer?.visibility =\\n if (it) {\\n if (replyEt.requestFocus()) replyEt.setSelection(replyEt.text.length)\\n View.VISIBLE\\n } else {\\n replyEt.clearFocus()\\n View.GONE\\n }\\n })\\n }\\n\\n private val config = WordTokenizerConfig.Builder()\\n .setExplicitChars(\\"@\\")\\n .setThreshold(3)\\n .setMaxNumKeywords(5)\\n .setWordBreakChars(\\" \\").build()\\n\\n private fun setupEditor() {\\n replyEt.tokenizer = WordTokenizer(config)\\n replyEt.setAvoidPrefixOnTap(true)\\n replyEt.setQueryTokenReceiver(this)\\n replyEt.setSuggestionsVisibilityManager(this)\\n replyEt.setAvoidPrefixOnTap(true)\\n\\n replyCommitButton.setOnClickListener { }\\n\\n if (Build.VERSION.SDK_INT >= Build.VERSION_CODES.O)\\n textCountProgress.min = 0\\n textCountProgress.max = STATUS_LIMIT\\n replyEt.addTextChangedListener(object : TextWatcher {\\n override fun afterTextChanged(s: Editable?) {\\n onTextChanged(s?.length ?: 0)\\n }\\n\\n override fun beforeTextChanged(s: CharSequence?, start: Int, count: Int, after: Int) {\\n\\n }\\n\\n override fun onTextChanged(s: CharSequence?, start: Int, before: Int, count: Int) {\\n\\n }\\n })\\n }\\n\\n\\n /**\\n * @param count \\u5b57\\u6570\\n */\\n private fun onTextChanged(count: Int) {\\n textCountProgress.progress = count\\n replyCommitButton.isEnabled = count in 1..STATUS_LIMIT\\n }\\n\\n\\n private val onSuggestionSelectListener = object : MentionListView.OnItemClickListener {\\n override fun onItemClick(position: Int, item: Suggestible) {\\n (item as Player).let {\\n replyEt.insertMention(it)\\n displaySuggestions(false)\\n playerViewModel.updateMentionedAt(it) //\\n onTextChanged(replyEt.text.length)\\n replyEt.requestFocus()\\n replyEt.setSelection(replyEt.text.length)\\n }\\n }\\n }\\n\\n @Suppress(\\"PrivatePropertyName\\")\\n private val BUCKET = \\"player-mentioned\\"\\n\\n override fun onQueryReceived(queryToken: QueryToken): MutableList {\\n val data = playerViewModel.filter(queryToken.keywords)\\n onReceiveSuggestionsResult(SuggestionsResult(queryToken, data), BUCKET)\\n return arrayOf(BUCKET).toMutableList()\\n }\\n\\n override fun onReceiveSuggestionsResult(result: SuggestionsResult, bucket: String) {\\n val data = result.suggestions\\n if (data?.isEmpty() != false) return\\n displaySuggestions(true)\\n findChildFragment(MentionListView::class.java).setData(data)\\n }\\n\\n override fun displaySuggestions(display: Boolean) {\\n viewPager.setCurrentItem(if (display) 1 else 0, true)\\n }\\n\\n override fun isDisplayingSuggestions() = viewPager.currentItem == 1\\n\\n override fun onBackPressedSupport(): Boolean {\\n when {\\n panelRootContainer.visibility == View.VISIBLE -> KeyboardUtil.hideKeyboard(panelRootContainer)\\n isDisplayingSuggestions -> displaySuggestions(false)\\n else -> pop()\\n }\\n return true\\n\\n }\\n\\n override fun onDestroy() {\\n keyboardListener?.let { KeyboardUtil.detach(activity, it) }\\n activity?.currentFocus?.let { KeyboardUtil.hideKeyboard(it) }\\n super.onDestroy()\\n }\\n\\n}",\n "license": "mit"\n },\n "truncated_cells": []\n }\n]', + 'NEGATIVE: DATASET_NAME: "vikp/starcoder_cleaned"\nFEATURES: {\'code\': {\'dtype\': \'string\', \'_type\': \'Value\'}, \'repo_path\': {\'dtype\': \'string\', \'_type\': \'Value\'}}\nDATA SAMPLE:\n[\n {\n "row_idx": 0,\n "row": {\n "code": "# ---\\n# jupyter:\\n# jupytext:\\n# text_representation:\\n# extension: .py\\n# format_name: light\\n# format_version: \'1.5\'\\n# jupytext_version: 1.14.4\\n# kernelspec:\\n# display_name: Python 3\\n# language: python\\n# name: python3\\n# ---\\n\\n# # 09 Strain Gage\\n#\\n# This is one of the most commonly used sensor. It is used in many transducers. Its fundamental operating principle is fairly easy to understand and it will be the purpose of this lecture. \\n#\\n# A strain gage is essentially a thin wire that is wrapped on film of plastic. \\n# \\n# The strain gage is then mounted (glued) on the part for which the strain must be measured. \\n# \\n#\\n# ## Stress, Strain\\n# When a beam is under axial load, the axial stress, $\\\\sigma_a$, is defined as:\\n# \\\\begin{align*}\\n# \\\\sigma_a = \\\\frac{F}{A}\\n# \\\\end{align*}\\n# with $F$ the axial load, and $A$ the cross sectional area of the beam under axial load.\\n#\\n# \\n#\\n# Under the load, the beam of length $L$ will extend by $dL$, giving rise to the definition of strain, $\\\\epsilon_a$:\\n# \\\\begin{align*}\\n# \\\\epsilon_a = \\\\frac{dL}{L}\\n# \\\\end{align*}\\n# The beam will also contract laterally: the cross sectional area is reduced by $dA$. This results in a transverval strain $\\\\epsilon_t$. The transversal and axial strains are related by the Poisson\'s ratio:\\n# \\\\begin{align*}\\n# \\\\nu = - \\\\frac{\\\\epsilon_t }{\\\\epsilon_a}\\n# \\\\end{align*}\\n# For a metal the Poission\'s ratio is typically $\\\\nu = 0.3$, for an incompressible material, such as rubber (or water), $\\\\nu = 0.5$.\\n#\\n# Within the elastic limit, the axial stress and axial strain are related through Hooke\'s law by the Young\'s modulus, $E$:\\n# \\\\begin{align*}\\n# \\\\sigma_a = E \\\\epsilon_a\\n# \\\\end{align*}\\n#\\n# \\n\\n# ## Resistance of a wire\\n#\\n# The electrical resistance of a wire $R$ is related to its physical properties (the electrical resistiviy, $\\\\rho$ in $\\\\Omega$/m) and its geometry: length $L$ and cross sectional area $A$.\\n#\\n# \\\\begin{align*}\\n# R = \\\\frac{\\\\rho L}{A}\\n# \\\\end{align*}\\n#\\n# Mathematically, the change in wire dimension will result inchange in its electrical resistance. This can be derived from first principle:\\n# \\\\begin{align}\\n# \\\\frac{dR}{R} = \\\\frac{d\\\\rho}{\\\\rho} + \\\\frac{dL}{L} - \\\\frac{dA}{A}\\n# \\\\end{align}\\n# If the wire has a square cross section, then:\\n# \\\\begin{align*}\\n# A & = L\'^2 \\\\\\\\\\n# \\\\frac{dA}{A} & = \\\\frac{d(L\'^2)}{L\'^2} = \\\\frac{2L\'dL\'}{L\'^2} = 2 \\\\frac{dL\'}{L\'}\\n# \\\\end{align*}\\n# We have related the change in cross sectional area to the transversal strain.\\n# \\\\begin{align*}\\n# \\\\epsilon_t = \\\\frac{dL\'}{L\'}\\n# \\\\end{align*}\\n# Using the Poisson\'s ratio, we can relate then relate the change in cross-sectional area ($dA/A$) to axial strain $\\\\epsilon_a = dL/L$.\\n# \\\\begin{align*}\\n# \\\\epsilon_t &= - \\\\nu \\\\epsilon_a \\\\\\\\\\n# \\\\frac{dL\'}{L\'} &= - \\\\nu \\\\frac{dL}{L} \\\\; \\\\text{or}\\\\\\\\\\n# \\\\frac{dA}{A} & = 2\\\\frac{dL\'}{L\'} = -2 \\\\nu \\\\frac{dL}{L}\\n# \\\\end{align*}\\n# Finally we can substitute express $dA/A$ in eq. for $dR/R$ and relate change in resistance to change of wire geometry, remembering that for a metal $\\\\nu =0.3$:\\n# \\\\begin{align}\\n# \\\\frac{dR}{R} & = \\\\frac{d\\\\rho}{\\\\rho} + \\\\frac{dL}{L} - \\\\frac{dA}{A} \\\\\\\\\\n# & = \\\\frac{d\\\\rho}{\\\\rho} + \\\\frac{dL}{L} - (-2\\\\nu \\\\frac{dL}{L}) \\\\\\\\\\n# & = \\\\frac{d\\\\rho}{\\\\rho} + 1.6 \\\\frac{dL}{L} = \\\\frac{d\\\\rho}{\\\\rho} + 1.6 \\\\epsilon_a\\n# \\\\end{align}\\n# It also happens that for most metals, the resistivity increases with axial strain. In general, one can then related the change in resistance to axial strain by defining the strain gage factor:\\n# \\\\begin{align}\\n# S = 1.6 + \\\\frac{d\\\\rho}{\\\\rho}\\\\cdot \\\\frac{1}{\\\\epsilon_a}\\n# \\\\end{align}\\n# and finally, we have:\\n# \\\\begin{align*}\\n# \\\\frac{dR}{R} = S \\\\epsilon_a\\n# \\\\end{align*}\\n# $S$ is materials dependent and is typically equal to 2.0 for most commercially availabe strain gages. It is dimensionless.\\n#\\n# Strain gages are made of thin wire that is wraped in several loops, effectively increasing the length of the wire and therefore the sensitivity of the sensor.\\n#\\n# _Question:\\n#\\n# Explain why a longer wire is necessary to increase the sensitivity of the sensor_.\\n#\\n# Most commercially available strain gages have a nominal resistance (resistance under no load, $R_{ini}$) of 120 or 350 $\\\\Omega$.\\n#\\n# Within the elastic regime, strain is typically within the range $10^{-6} - 10^{-3}$, in fact strain is expressed in unit of microstrain, with a 1 microstrain = $10^{-6}$. Therefore, changes in resistances will be of the same order. If one were to measure resistances, we will need a dynamic range of 120 dB, whih is typically very expensive. Instead, one uses the Wheatstone bridge to transform the change in resistance to a voltage, which is easier to measure and does not require such a large dynamic range.\\n\\n# ## Wheatstone bridge:\\n# \\n#\\n# The output voltage is related to the difference in resistances in the bridge:\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} = \\\\frac{R_1R_3-R_2R_4}{(R_1+R_4)(R_2+R_3)}\\n# \\\\end{align*}\\n#\\n# If the bridge is balanced, then $V_o = 0$, it implies: $R_1/R_2 = R_4/R_3$.\\n#\\n# In practice, finding a set of resistors that balances the bridge is challenging, and a potentiometer is used as one of the resistances to do minor adjustement to balance the bridge. If one did not do the adjustement (ie if we did not zero the bridge) then all the measurement will have an offset or bias that could be removed in a post-processing phase, as long as the bias stayed constant.\\n#\\n# If each resistance $R_i$ is made to vary slightly around its initial value, ie $R_i = R_{i,ini} + dR_i$. For simplicity, we will assume that the initial value of the four resistances are equal, ie $R_{1,ini} = R_{2,ini} = R_{3,ini} = R_{4,ini} = R_{ini}$. This implies that the bridge was initially balanced, then the output voltage would be:\\n#\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} = \\\\frac{1}{4} \\\\left( \\\\frac{dR_1}{R_{ini}} - \\\\frac{dR_2}{R_{ini}} + \\\\frac{dR_3}{R_{ini}} - \\\\frac{dR_4}{R_{ini}} \\\\right)\\n# \\\\end{align*}\\n#\\n# Note here that the changes in $R_1$ and $R_3$ have a positive effect on $V_o$, while the changes in $R_2$ and $R_4$ have a negative effect on $V_o$. In practice, this means that is a beam is a in tension, then a strain gage mounted on the branch 1 or 3 of the Wheatstone bridge will produce a positive voltage, while a strain gage mounted on branch 2 or 4 will produce a negative voltage. One takes advantage of this to increase sensitivity to measure strain.\\n#\\n# ### Quarter bridge\\n# One uses only one quarter of the bridge, ie strain gages are only mounted on one branch of the bridge.\\n#\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} = \\\\pm \\\\frac{1}{4} \\\\epsilon_a S\\n# \\\\end{align*}\\n# Sensitivity, $G$:\\n# \\\\begin{align*}\\n# G = \\\\frac{V_o}{\\\\epsilon_a} = \\\\pm \\\\frac{1}{4}S V_s\\n# \\\\end{align*}\\n#\\n#\\n# ### Half bridge\\n# One uses half of the bridge, ie strain gages are mounted on two branches of the bridge.\\n#\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} = \\\\pm \\\\frac{1}{2} \\\\epsilon_a S\\n# \\\\end{align*}\\n#\\n# ### Full bridge\\n#\\n# One uses of the branches of the bridge, ie strain gages are mounted on each branch.\\n#\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} = \\\\pm \\\\epsilon_a S\\n# \\\\end{align*}\\n#\\n# Therefore, as we increase the order of bridge, the sensitivity of the instrument increases. However, one should be carefull how we mount the strain gages as to not cancel out their measurement.\\n\\n# _Exercise_\\n#\\n# 1- Wheatstone bridge\\n#\\n# \\n#\\n# > How important is it to know \\\\& match the resistances of the resistors you employ to create your bridge?\\n# > How would you do that practically?\\n# > Assume $R_1=120\\\\,\\\\Omega$, $R_2=120\\\\,\\\\Omega$, $R_3=120\\\\,\\\\Omega$, $R_4=110\\\\,\\\\Omega$, $V_s=5.00\\\\,\\\\text{V}$. What is $V_\\\\circ$?\\n\\nVs = 5.00\\nVo = (120**2-120*110)/(230*240) * Vs\\nprint(\'Vo = \',Vo, \' V\')\\n\\n# typical range in strain a strain gauge can measure\\n# 1 -1000 micro-Strain\\nAxialStrain = 1000*10**(-6) # axial strain\\nStrainGageFactor = 2\\nR_ini = 120 # Ohm\\nR_1 = R_ini+R_ini*StrainGageFactor*AxialStrain\\nprint(R_1)\\nVo = (120**2-120*(R_1))/((120+R_1)*240) * Vs\\nprint(\'Vo = \', Vo, \' V\')\\n\\n# > How important is it to know \\\\& match the resistances of the resistors you employ to create your bridge?\\n# > How would you do that practically?\\n# > Assume $R_1= R_2 =R_3=120\\\\,\\\\Omega$, $R_4=120.01\\\\,\\\\Omega$, $V_s=5.00\\\\,\\\\text{V}$. What is $V_\\\\circ$?\\n\\nVs = 5.00\\nVo = (120**2-120*120.01)/(240.01*240) * Vs\\nprint(Vo)\\n\\n# 2- Strain gage 1:\\n#\\n# One measures the strain on a bridge steel beam. The modulus of elasticity is $E=190$ GPa. Only one strain gage is mounted on the bottom of the beam; the strain gage factor is $S=2.02$.\\n#\\n# > a) What kind of electronic circuit will you use? Draw a sketch of it.\\n#\\n# > b) Assume all your resistors including the unloaded strain gage are balanced and measure $120\\\\,\\\\Omega$, and that the strain gage is at location $R_2$. The supply voltage is $5.00\\\\,\\\\text{VDC}$. Will $V_\\\\circ$ be positive or negative when a downward load is added?\\n\\n# In practice, we cannot have all resistances = 120 $\\\\Omega$. at zero load, the bridge will be unbalanced (show $V_o \\\\neq 0$). How could we balance our bridge?\\n#\\n# Use a potentiometer to balance bridge, for the load cell, we \'\'zero\'\' the instrument.\\n#\\n# Other option to zero-out our instrument? Take data at zero-load, record the voltage, $V_{o,noload}$. Substract $V_{o,noload}$ to my data.\\n\\n# > c) For a loading in which $V_\\\\circ = -1.25\\\\,\\\\text{mV}$, calculate the strain $\\\\epsilon_a$ in units of microstrain.\\n\\n# \\\\begin{align*}\\n# \\\\frac{V_o}{V_s} & = - \\\\frac{1}{4} \\\\epsilon_a S\\\\\\\\\\n# \\\\epsilon_a & = -\\\\frac{4}{S} \\\\frac{V_o}{V_s}\\n# \\\\end{align*}\\n\\nS = 2.02\\nVo = -0.00125\\nVs = 5\\neps_a = -1*(4/S)*(Vo/Vs)\\nprint(eps_a)\\n\\n# > d) Calculate the axial stress (in MPa) in the beam under this load.\\n\\n\\n\\n# > e) You now want more sensitivity in your measurement, you install a second strain gage on to\\n\\n# p of the beam. Which resistor should you use for this second active strain gage?\\n#\\n# > f) With this new setup and the same applied load than previously, what should be the output voltage?\\n\\n# 3- Strain Gage with Long Lead Wires \\n#\\n# \\n#\\n# A quarter bridge strain gage Wheatstone bridge circuit is constructed with $120\\\\,\\\\Omega$ resistors and a $120\\\\,\\\\Omega$ strain gage. For this practical application, the strain gage is located very far away form the DAQ station and the lead wires to the strain gage are $10\\\\,\\\\text{m}$ long and the lead wire have a resistance of $0.080\\\\,\\\\Omega/\\\\text{m}$. The lead wire resistance can lead to problems since $R_{lead}$ changes with temperature.\\n#\\n# > Design a modified circuit that will cancel out the effect of the lead wires.\\n\\n# ## Homework\\n#\\n",\n "repo_path": "Lectures/09_StrainGage.ipynb"\n },\n "truncated_cells": []\n },\n {\n "row_idx": 1,\n "row": {\n "code": "# ---\\n# jupyter:\\n# jupytext:\\n# split_at_heading: true\\n# text_representation:\\n# extension: .py\\n# format_name: light\\n# format_version: \'1.5\'\\n# jupytext_version: 1.14.4\\n# kernelspec:\\n# display_name: Python 3\\n# language: python\\n# name: python3\\n# ---\\n\\n#export\\nfrom fastai.basics import *\\nfrom fastai.tabular.core import *\\nfrom fastai.tabular.model import *\\n\\nfrom fastai.tabular.data import *\\n\\n#hide\\nfrom nbdev.showdoc import *\\n\\n\\n# +\\n#default_exp tabular.learner\\n# -\\n\\n# # Tabular learner\\n#\\n# > The function to immediately get a `Learner` ready to train for tabular data\\n\\n# The main function you probably want to use in this module is `tabular_learner`. It will automatically create a `TabulaModel` suitable for your data and infer the irght loss function. See the [tabular tutorial](http://docs.fast.ai/tutorial.tabular) for an example of use in context.\\n\\n# ## Main functions\\n\\n#export\\n@log_args(but_as=Learner.__init__)\\nclass TabularLearner(Learner):\\n \\"`Learner` for tabular data\\"\\n def predict(self, row):\\n tst_to = self.dls.valid_ds.new(pd.DataFrame(row).T)\\n tst_to.process()\\n tst_to.conts = tst_to.conts.astype(np.float32)\\n dl = self.dls.valid.new(tst_to)\\n inp,preds,_,dec_preds = self.get_preds(dl=dl, with_input=True, with_decoded=True)\\n i = getattr(self.dls, \'n_inp\', -1)\\n b = (*tuplify(inp),*tuplify(dec_preds))\\n full_dec = self.dls.decode((*tuplify(inp),*tuplify(dec_preds)))\\n return full_dec,dec_preds[0],preds[0]\\n\\n\\nshow_doc(TabularLearner, title_level=3)\\n\\n\\n# It works exactly as a normal `Learner`, the only difference is that it implements a `predict` method specific to work on a row of data.\\n\\n#export\\n@log_args(to_return=True, but_as=Learner.__init__)\\n@delegates(Learner.__init__)\\ndef tabular_learner(dls, layers=None, emb_szs=None, config=None, n_out=None, y_range=None, **kwargs):\\n \\"Get a `Learner` using `dls`, with `metrics`, including a `TabularModel` created using the remaining params.\\"\\n if config is None: config = tabular_config()\\n if layers is None: layers = [200,100]\\n to = dls.train_ds\\n emb_szs = get_emb_sz(dls.train_ds, {} if emb_szs is None else emb_szs)\\n if n_out is None: n_out = get_c(dls)\\n assert n_out, \\"`n_out` is not defined, and could not be infered from data, set `dls.c` or pass `n_out`\\"\\n if y_range is None and \'y_range\' in config: y_range = config.pop(\'y_range\')\\n model = TabularModel(emb_szs, len(dls.cont_names), n_out, layers, y_range=y_range, **config)\\n return TabularLearner(dls, model, **kwargs)\\n\\n\\n# If your data was built with fastai, you probably won\'t need to pass anything to `emb_szs` unless you want to change the default of the library (produced by `get_emb_sz`), same for `n_out` which should be automatically inferred. `layers` will default to `[200,100]` and is passed to `TabularModel` along with the `config`.\\n#\\n# Use `tabular_config` to create a `config` and cusotmize the model used. There is just easy access to `y_range` because this argument is often used.\\n#\\n# All the other arguments are passed to `Learner`.\\n\\npath = untar_data(URLs.ADULT_SAMPLE)\\ndf = pd.read_csv(path/\'adult.csv\')\\ncat_names = [\'workclass\', \'education\', \'marital-status\', \'occupation\', \'relationship\', \'race\']\\ncont_names = [\'age\', \'fnlwgt\', \'education-num\']\\nprocs = [Categorify, FillMissing, Normalize]\\ndls = TabularDataLoaders.from_df(df, path, procs=procs, cat_names=cat_names, cont_names=cont_names, \\n y_names=\\"salary\\", valid_idx=list(range(800,1000)), bs=64)\\nlearn = tabular_learner(dls)\\n\\n#hide\\ntst = learn.predict(df.iloc[0])\\n\\n# +\\n#hide\\n#test y_range is passed\\nlearn = tabular_learner(dls, y_range=(0,32))\\nassert isinstance(learn.model.layers[-1], SigmoidRange)\\ntest_eq(learn.model.layers[-1].low, 0)\\ntest_eq(learn.model.layers[-1].high, 32)\\n\\nlearn = tabular_learner(dls, config = tabular_config(y_range=(0,32)))\\nassert isinstance(learn.model.layers[-1], SigmoidRange)\\ntest_eq(learn.model.layers[-1].low, 0)\\ntest_eq(learn.model.layers[-1].high, 32)\\n\\n\\n# -\\n\\n#export\\n@typedispatch\\ndef show_results(x:Tabular, y:Tabular, samples, outs, ctxs=None, max_n=10, **kwargs):\\n df = x.all_cols[:max_n]\\n for n in x.y_names: df[n+\'_pred\'] = y[n][:max_n].values\\n display_df(df)\\n\\n\\n# ## Export -\\n\\n#hide\\nfrom nbdev.export import notebook2script\\nnotebook2script()\\n\\n\\n",\n "repo_path": "nbs/43_tabular.learner.ipynb"\n },\n "truncated_cells": []\n }\n]', +] +embeddings = model.encode(sentences) +print(embeddings.shape) +# [3, 768] + +# Get the similarity scores for the embeddings +similarities = model.similarity(embeddings, embeddings) +print(similarities.shape) +# [3, 3] +``` + + + + + + + +## Evaluation + +### Metrics + +#### Triplet + +* Evaluated with [TripletEvaluator](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator) + +| Metric | Value | +|:-------------------|:--------| +| cosine_accuracy | 1.0 | +| dot_accuracy | 0.0 | +| manhattan_accuracy | 1.0 | +| euclidean_accuracy | 1.0 | +| **max_accuracy** | **1.0** | + + + + + +## Training Details + +### Training Dataset + +#### query-to-dataset-viewer-descriptions + +* Dataset: [query-to-dataset-viewer-descriptions](https://huggingface.co/datasets/davanstrien/query-to-dataset-viewer-descriptions) +* Size: 1,141 training samples +* Columns: query, positive, and negative +* Approximate statistics based on the first 1000 samples: + | | query | positive | negative | + |:--------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------| + | type | string | string | string | + | details |
  • min: 9 tokens
  • mean: 11.72 tokens
  • max: 19 tokens
|
  • min: 40 tokens
  • mean: 2018.88 tokens
  • max: 8192 tokens
|
  • min: 41 tokens
  • mean: 2125.25 tokens
  • max: 8192 tokens
| +* Samples: + | query | positive | negative | + |:------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| + | USER_QUERY: LLM paper dataset | HUB_DATASET_PREVIEW: DATASET_NAME: "MarkrAI/AutoRAG-evaluation-2024-LLM-paper-v1"
FEATURES: {'doc_id': {'dtype': 'string', '_type': 'Value'}, 'contents': {'dtype': 'string', '_type': 'Value'}, 'metadata': {'creation_datetime': {'dtype': 'string', '_type': 'Value'}, 'file_name': {'dtype': 'string', '_type': 'Value'}, 'file_path': {'dtype': 'string', '_type': 'Value'}, 'file_size': {'dtype': 'int64', '_type': 'Value'}, 'file_type': {'dtype': 'null', '_type': 'Value'}, 'last_accessed_datetime': {'dtype': 'string', '_type': 'Value'}, 'last_modified_datetime': {'dtype': 'string', '_type': 'Value'}}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"doc_id": "6f86094c-47fe-43de-a77a-e8c34c69c997",
"contents": "# Rag-Driver: Generalisable Driving Explanations With Retrieval-Augmented In-Context Learning In Multi-Modal Large Language Model\n\nJianhao Yuan1, Shuyang Sun1, Daniel Omeiza1, Bo Zhao2, Paul Newman1, Lars Kunze1, Matthew Gadd1\n1 University of Oxford 2 Beijing Academy of Artificial Intelligence\n{jianhaoyuan,kevinsun,daniel,pnewman,lars,mattgadd}@robots.ox.ac.uk \nAbstract\u2014Robots powered by 'blackbox' models need to provide\nhuman-understandable explanations which we can trust. Hence,\nexplainability plays a critical role in trustworthy autonomous\ndecision-making to foster transparency and acceptance among\nend users, especially in complex autonomous driving. Recent\nadvancements in Multi-Modal Large Language models (MLLMs)\nhave shown promising potential in enhancing the explainability\nas a driving agent by producing control predictions along with\nnatural language explanations. However, severe data scarcity\ndue to expensive annotation costs and significant domain gaps\nbetween different datasets makes the development of a robust and\ngeneralisable system an extremely challenging task. Moreover, the\nprohibitively expensive training requirements of MLLM and the\nunsolved problem of catastrophic forgetting further limit their\ngeneralisability post-deployment. To address these challenges, we\npresent RAG-Driver, a novel retrieval-augmented multi-modal\nlarge language model that leverages in-context learning for high-\nperformance, explainable, and generalisable autonomous driving.\nBy grounding in retrieved expert demonstration, we empirically\nvalidate that RAG-Driver achieves state-of-the-art performance in\nproducing driving action explanations, justifications, and control\nsignal prediction. More importantly, it exhibits exceptional zero-\nshot generalisation capabilities to unseen environments without \nfurther training endeavours1.\nIndex Terms\u2014Autonomous driving, multi-modal language\nmodel, end-to-end driving, domain generalisation",
"metadata": {
"creation_datetime": "2024-03-04",
"file_name": "2402.10828v1.md",
"file_path": "paper_data/2402.10828v1.md",
"file_size": 64885,
"file_type": null,
"last_accessed_datetime": "2024-03-04",
"last_modified_datetime": "2024-02-22"
}
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"doc_id": "cf485ad0-8ec4-4a63-a0c6-5d7eb499c0c8",
"contents": "# Rag-Driver: Generalisable Driving Explanations With Retrieval-Augmented In-Context Learning In Multi-Modal Large Language Model\n## I. Introduction\n\nDriven by the emerging development of deep learning, autonomous driving has observed a paradigm shift from rulesbased decision systems [66, 21] to data-driven learning-based approaches [28, 6, 36]. However, this comes at the cost of transparency in decision-making, especially for end-to-end autonomous driving systems which are considered black-box in nature [13]. Thus, in addition to precision in action control, explanation provision is key in ensuring trustworthy decisionmaking to reconcile the system's decisions with end-user expectations to foster confidence and acceptance [79, 8, 57] in dynamic driving environments. \nTraditional approaches have mainly relied on attention visualisation [5, 7, 55] as a proxy to rationalise the decisions of the black-box systems or auxiliary intermediate tasks such as semantic segmentation [25, 32], object detection [16, 31], and affordance prediction [68, 45] provide meaningful intermediate representation for decision-making. However, these methods do not engage end-users in the dialogue as they are onedirectional and not readily comprehensible by the general users for the purpose of fostering trust and confidence. An alternative promising approach is the integration of natural language explanations [38, 33, 54], in particular through Multi-Modal Large Language Models (MLLMs) [1, 70]. These models, pretrained on extensive web-scale datasets, demonstrate remarkable reasoning capacity, enabling the transformation of complex vehicular decision-making processes into more understandable narrative formats, thereby offering a new layer of explainability to conventional systems. \nWhile several early attempts have demonstrated the potential of MLLMs as general explainable driving agents [78, 76, 51], these methods fall short of human-level understanding. One of the limitations is their failure to generalise to unseen environments. A primary obstacle is the lack of high-quality annotated data [56], coupled with the significant domain shift across various datasets [23], which hinders the models' generalisation capacity to novel environments outside of the training data distribution. Another critical challenge is the prohibitively expensive training requirement and the unsolved problem of catastrophic forgetting [39], which make re-training or finetuning impractical solutions due to the immense computational demands and severe performance degradation. Consequently, this further limits the models' generalisability after deployment, as they struggle to effectively utilise new data in constantly evolving environments and driving scenarios. \nTo address these challenges, we introduce *RAG-Driver*, a novel retrieval-augment",
"metadata": {
"creation_datetime": "2024-03-04",
"file_name": "2402.10828v1.md",
"file_path": "paper_data/2402.10828v1.md",
"file_size": 64885,
"file_type": null,
"last_accessed_datetime": "2024-03-04",
"last_modified_datetime": "2024-02-22"
}
},
"truncated_cells": []
}
]
| NEGATIVE: DATASET_NAME: "emozilla/dolma-v1_7-arxiv"
FEATURES: {'text': {'dtype': 'string', '_type': 'Value'}, 'id': {'dtype': 'string', '_type': 'Value'}, 'metadata': {'file_path': {'dtype': 'string', '_type': 'Value'}}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"text": "\\section{Introduction}\nLet $G$ be a simple undirected graph with the \\textit{vertex set} $V(G)$ and the \\textit{edge set} $E(G)$. A vertex with degree one is called a \\textit{pendant vertex}. The distance between the vertices $u$ and $v$ in graph $G$ is denoted by $d_G(u,v)$. A cycle $C$ is called \\textit{chordless} if $C$ has no \\textit{cycle chord} (that is an edge not in the edge set of $C$ whose endpoints lie on the vertices of $C$).\nThe \\textit{Induced subgraph} on vertex set $S$ is denoted by $\\langle S\\rangle$. A path that starts in $v$ and ends in $u$ is denoted by $\\stackrel\\frown{v u}$.\nA \\textit{traceable} graph is a graph that possesses a Hamiltonian path.\nIn a graph $G$, we say that a cycle $C$ is \\textit{formed by the path} $Q$ if $ | E(C) \\setminus E(Q) | = 1 $. So every vertex of $C$ belongs to $V(Q)$.\n\nIn 2011 the following conjecture was proposed:\n\\begin{conjecture}(Hoffmann-Ostenhof \\cite{hoffman})\nLet $G$ be a connected cubic graph. Then $G$ has a decomposition into a spanning tree, a matching and a family of cycles.\n\n\\end{conjecture}\nConjecture \\theconjecture$\\,$ also appears in Problem 516 \\cite{cameron}. There are a few partial results known for Conjecture \\theconjecture. Kostochka \\cite{kostocha} noticed that the Petersen graph, the prisms over cycles, and many other graphs have a decomposition desired in Conjecture \\theconjecture. Ozeki and Ye \\cite{ozeki} proved that the conjecture holds for 3-connected cubic plane graphs. Furthermore, it was proved by Bachstein \\cite{bachstein} that Conjecture \\theconjecture$\\,$ is true for every 3-connected cubic graph embedded in torus or Klein-bottle. Akbari, Jensen and Siggers \\cite[Theorem 9]{akbari} showed that Conjecture \\theconjecture$\\,$ is true for Hamiltonian cubic graphs.\n\nIn this paper, we show that Conjecture \\theconjecture$\\,$ holds for traceable cubic graphs.\n\\section{Results}\nBefore proving the main result, we need the following lemma.\n\\begin{lemma}\n\\label{lemma:1}\nLet $G$ be a cubic graph. Suppose that $V(G)$ can be partitioned into a tree $T$ and finitely many cycles such that there is no edge between any pair of cycles (not necessarily distinct cycles), and every pendant vertex of $T$ is adjacent to at least one vertex of a cycle. Then, Conjecture \\theconjecture$\\,$ holds for $G$.\n\\end{lemma}\n\\begin{proof}\nBy assumption, every vertex of each cycle in the partition is adjacent to exactly one vertex of $T$. Call the set of all edges with one endpoint in a cycle and another endpoint in $T$ by $Q$.\nClearly, the induced subgraph on $E(T) \\cup Q$ is a spanning tree of $G$. We call it $T'$. Note that every edge between a pendant vertex of $T$ and the union of cycles in the partition is also contained in $T'$. Thus, every pendant vertex of $T'$ is contained in a cycle of the partition. Now, consider the graph $H = G \\setminus E(T')$. For every $v \\in V(T)$, $d_H(v) \\leq 1$. So Conjecture \\theconjecture$\\,$ holds for $G$. \\vspace{1em}\n\\end{proof}\n\n\n\\noindent\\textbf{Remark 1.}\n\\label{remark:1}\nLet $C$ be a cycle formed by the path $Q$. Then clearly there exists a chordless cycle formed by $Q$.\n\nNow, we are in a position to prove the main result.\n\n\\begin{theorem}\nConjecture \\theconjecture$\\,$ holds for traceable cubic graphs.\n\\end{theorem}\n\\begin{proof}\nLet $G$ be a traceable cubic graph and $P : v_1, \\dots, v_n$ be a Hamiltonian path in $G$. By \\cite[Theorem 9]{akbari}, Conjecture A holds for $v_1 v_n \\in E(G)$. Thus we can assume that $v_1 v_n \\notin E(G)$. Let $v_1 v_j, v_1 v_{j'}, v_i v_n, v_{i'} v_n \\in E(G)\\setminus E(P)$ and $j' < j < n$, $1 < i < i'$. Two cases can occur:\n\\begin{enumerate}[leftmargin=0pt,label=]\n\\item\n\\textbf{Case 1.}\nAssume that $i < j$. Consider the following graph in Figure \\ref{fig:overlapping} in which the thick edges denote the path $P$. Call the three paths between $v_j$ and $v_i$, from the left to the right, by $P_1$, $P_2$ and $P_3$, respectively (note that $P_1$ contains the edge $e'$ and $P_3$ contains the edge $e$).\n\n\\begin{figure}[H]\n \\begin{center}\n \\includegraphics[width=40mm]{engImages/overlapping.pdf}\n \\caption{Paths $P_1$, $P_2$ and $P_3$}\n \\label{fig:overlapping}\n \\end{center}\n\\end{figure}\n\n\nIf $P_2$ has order $2$, then $G$ is Hamiltonian and so by \\cite[Theorem 9]{akbari} Conjecture \\theconjecture$\\,$ holds. Thus we can assume that $P_1$, $P_2$ and $P_3$ have order at least $3$. Now, consider the following subcases:\\\\\n\n\\begin{enumerate}[leftmargin=0pt,label=]\n\\label{case:1}\n\\item \\textbf{Subcase 1.} There is no edge between $V(P_r)$ and $V(P_s)$ for $1 \\leq r < s \\leq 3$. Since every vertex of $P_i$ has degree 3 for every $i$, by \\hyperref[remark:1]{Remark 1}$\\,$ there are two chordless cycles $C_1$ and $C_2$ formed by $P_1$ and $P_2$, respectively.\nDefine a tree $T$ with the edge set\n$$ E\\Big(\\langle V(G) \\setminus \\big(V(C_1) \\cup V(C_2)\\big) \\rangle\\Big) \\bigcap \\big(\\bigcup_{i=1}^3 E(P_i)\\big).$$\nNow, apply \\hyperref[lemma:1]{Lemma 1} $\\,$for the partition $\\{T, C_1, C_2\\}$.\\\\\n\n\\item \\textbf{Subcase 2.}\n\\label{case:edge}\nThere exists at least one edge between some $P_r$ and $P_s$, $r i'$ such that $v_s v_t \\in E(G)$. By \\hyperref[remark:1]{Remark 1} $\\,$ there are two chordless cycles $C_1$ and $C_2$, respectively formed by the paths $v_1 v_{j'}$ and $v_{i'} v_n$. By assumption there is no edge $xy$, where $x \\in V(C_1)$ and $y \\in V(C_2)$.\nDefine a tree $T$ with the edge set:\n$$ E\\Big(\\langle V(G) \\setminus \\big(V(C_1) \\cup V(C_2)\\big) \\rangle \\Big) \\bigcap \\Big( E(P) \\cup \\{v_{i'}v_n, v_{j'}v_1\\} \\Big).$$\nNow, apply \\hyperref[lemma:1]{Lemma 1} $\\,$for the partition $\\{T, C_1, C_2\\}$.\\\\\n\n\\item \\textbf{Subcase 2.}\n\\label{subcase:22} There are at least four indices $s, s' < j$ and $t, t' > i$ such that $v_s v_t, v_{s'} v_{t'} \\in E(G)$. Choose four indices $g, h < j$ and $e, f > i$ such that $v_h v_e, v_g v_f \\in E(G)$ and $|g-h| + |e-f|$ is minimum.\n\n\\begin{figure}[H]\n \\begin{center}\n \\includegraphics[width=90mm]{engImages/case2-subcase2.pdf}\n \\caption{Two edges $v_h v_e$ and $v_g v_f$}\n \\label{fig:non-overlapping}\n \\end{center}\n\\end{figure}\n\nThree cases can be considered:\\\\\n\n\\begin{enumerate}[leftmargin=0pt,label=(\\alph*)]\n\\item There is no chordless cycle formed by $\\stackrel\\frown{v_g v_h}$ and by $\\stackrel\\frown{v_e v_f}$.\n\nConsider the cycle $\\stackrel\\frown{v_g v_h} \\stackrel\\frown{v_e v_f}v_g$ and call it $C$. Now, define a tree $T$ with the edge set,\n$$\\,\\,\\,E\\Big(\\langle V(G) \\setminus V(C)\\rangle \\Big) \\bigcap \\Big( E(P) \\cup \\{v_1v_{j}, v_{i}v_n\\} \\Big),$$\napply \\hyperref[lemma:1]{Lemma 1} $\\,$for the partition $\\{T, C\\}$.\\\\\n\n\\item With no loss of generality, there exists a chordless cycle formed by $\\stackrel\\frown{v_e v_f}$ and there is no chordless cycle formed by the path $\\stackrel\\frown{v_g v_h}$. First suppose that there is a chordless cycle $C_1$ formed by $\\stackrel\\frown{v_e v_f}$ such that there is no edge between $V(C_1)$ and $\\{v_1, \\dots, v_j\\}$. By \\hyperref[remark:1]{Remark 1} $,$ there exists a chordless cycle $C_2$ formed by $\\stackrel\\frown{v_1 v_j}$. By assumption there is no edge between $V(C_1)$ and $V(C_2)$. Now, define a tree $T$ with the edge set,\n\n$$\\quad\\quad\\quad\\quad E\\Big(\\langle V(G) \\setminus \\big(V(C_1) \\cup V(C_2)\\big)\\rangle \\Big) \\bigcap \\Big( E(P) \\cup \\{v_1v_{j}, v_{i}v_n\\} \\Big),$$\n\nand apply \\hyperref[lemma:1]{Lemma 1} $\\,$for the partition $\\{T, C_1, C_2\\}$.\n\n$\\;$ Next assume that for every cycle $C_r$ formed by $\\stackrel\\frown{v_e v_f}$, there are two vertices $x_r \\in V(C_r)$ and $y_r \\in \\{v_1, \\dots, v_j\\}$ such that $x_r y_r \\in E(G)$. Let $v_e=w_0, w_1, \\dots, w_l=v_f$ be all vertices of the path $\\stackrel\\frown{v_e v_f}$ in $P$. Choose the shortest path $w_0 w_{i_1} w_{i_2} \\dots w_l$ such that $0 < i_1 < i_2 < \\dots < l$. Consider the cycle $w_0 w_{i_1} \\dots w_l \\stackrel\\frown{v_g v_h}$ and call it $C$. Now, by removing $C$, $q$ vertex disjoint paths $Q_1, \\dots, Q_q$ which are contained in $\\stackrel\\frown{v_e v_f}$ remain. Note that there exists a path of order $2$ in $C$ which by adding this path to $Q_i$ we find a cycle $C_{r_i}$, for some $i$. Hence there exists an edge $x_{r_i} y_{r_i}$ connecting $Q_i$ to $V(G) \\setminus V(\\stackrel\\frown{v_e v_f})$. We define a tree $T$ whose edge set is the edges,\n$$\\quad\\quad\\quad\\quad\\quad\\quad E\\Big(\\langle V(G) \\setminus V(C)\\rangle \\Big) \\bigcap \\Big( E(P) \\cup \\{v_1v_{j}, v_{i}v_n\\} \\cup \\big\\{x_{r_i} y_{r_i} \\mid 1 \\leq i \\leq q\\big\\} \\Big),$$\nthen apply \\hyperref[lemma:1]{Lemma 1} $\\,$ on the partition $\\{T, C\\}$.\\\\\n\\begin{figure}[H]\n \\begin{center}\n \\includegraphics[width=90mm]{engImages/deltaNonOverlapping.pdf}\n \\caption{The tree $T$ and the shortest path $w_0 w_{i_1}\\dots w_l$}\n \\label{fig:delta-non-overlapping}\n \\end{center}\n\\end{figure}\n\n\\item There are at least two chordless cycles, say $C_1$ and $C_2$ formed by the paths $\\stackrel\\frown{v_g v_h}$ and $\\stackrel\\frown{v_e v_f}$, respectively. Since $|g-h| + |e-f|$ is minimum, there is no edge $xy \\in E(G)$ with $x \\in V(C_1)$ and $y \\in V(C_2)$. Now, define a tree $T$ with the edge set,\n$$\\quad\\quad\\quad\\quad E\\Big( \\langle V(G) \\setminus \\big(V(C_1) \\cup V(C_2)\\big) \\rangle \\Big) \\bigcap \\Big( E(P) \\cup \\{v_1 v_{j}, v_{i}v_n\\} \\Big),$$\nand apply \\hyperref[lemma:1]{Lemma 1} $\\,$for the partition $\\{T, C_1, C_2\\}$.\\\\\n\\end{enumerate}\n\n\\item \\textbf{Subcase 3.} There exist exactly two indices $s,t$, $s < j' < i' < t$ such that $v_s v_t \\in E(G)$ and there are no two other indices $s', t'$ such that $s' < j < i < t'$ and $v_{s'} v_{t'} \\in E(G)$. We can assume that there is no cycle formed by $\\stackrel\\frown{v_{s+1} v_j}$ or $\\stackrel\\frown{v_i v_{t-1}}$, to see this by symmetry consider a cycle $C$ formed by $\\stackrel\\frown{v_{s+1} v_j}$. By \\hyperref[remark:1]{Remark 1} $\\,$ there exist chordless cycles $C_1$ formed by $\\stackrel\\frown{v_{s+1} v_j}$ and $C_2$ formed by $\\stackrel\\frown{v_{i} v_n}$. By assumption $v_s v_t$ is the only edge such that $s < j$ and $t > i \\;$. Therefore, there is no edge between $V(C_1)$ and $V(C_2)$. Now, let $T$ be a tree defined by the edge set,\n$$ E\\Big(\\langle V(G) \\setminus \\big(V(C_1) \\cup V(C_2)\\big)\\rangle \\Big) \\bigcap \\Big( E(P) \\cup \\{v_1v_{j}, v_{i}v_n\\} \\Big),$$\nand apply \\hyperref[lemma:1]{Lemma 1} $\\,$for the partition \\{$T$, $C_1$, $C_2$\\}.\\\\\n\n$\\quad$Furthermore, we can also assume that either $s \\neq j'-1$ or $t \\neq i'+1$, otherwise we have the Hamiltonian cycle $\\stackrel\\frown{v_1 v_s} \\stackrel\\frown{v_t v_n} \\stackrel\\frown{v_{i'} v_{j'}} v_1$ and by \\cite[Theorem 9]{akbari} Conjecture \\theconjecture$\\,$ holds.\n\n$\\quad$By symmetry, suppose that $s \\neq j'-1$. Let $v_k$ be the vertex adjacent to $v_{j'-1}$, and $k \\notin \\{j'-2, j'\\}$. It can be shown that $k > j'-1$, since otherwise by considering the Hamiltonian path $P': \\; \\stackrel\\frown{ v_{k+1} v_{j'-1}}\\stackrel\\frown{v_k v_1} \\stackrel\\frown{v_{j'} v_n}$, the new $i'-j'$ is greater than the old one and this contradicts our assumption about $P$ in the \\hyperref[case:2]{Case 2}.\n\n$\\quad$We know that $j' < k < i$. Moreover, the fact that $\\stackrel\\frown{v_{s+1} v_j}$ does not form a cycle contradicts the case that $j' < k \\le j$. So $j < k < i$. Consider two cycles $C_1$ and $C_2$, respectively with the vertices $v_1 \\stackrel\\frown{v_{j'} v_{j}} v_1$ and $v_n \\stackrel\\frown{v_{i'} v_{i}} v_n$. The cycles $C_1$ and $C_2$ are chordless, otherwise there exist cycles formed by the paths $\\stackrel\\frown{v_{s+1} v_j}$ or $\\stackrel\\frown{v_i v_{t-1}}$. Now, define a tree $T$ with the edge set\n$$ E\\Big(\\langle V(G) \\setminus \\big(V(C_1) \\cup V(C_2)\\big)\\rangle \\Big) \\bigcap \\Big( E(P) \\cup \\{v_s v_t, v_k v_{j'-1}\\} \\Big),$$\nand apply \\hyperref[lemma:1]{Lemma 1} $\\,$for the partition \\{$T$, $C_1$, $C_2$\\}.\n\\end{enumerate}\n\\end{enumerate}\n\\end{proof}\n\n\\noindent\\textbf{Remark 2.}\n\\label{remark:2}\nIndeed, in the proof of the previous theorem we showed a stronger result, that is, for every traceable cubic graph there is a decomposition with at most two cycles.\n\n",
"id": "b7c40b41b7eedaa408f87d154284a1aba126589c",
"metadata": {
"file_path": "/home/ubuntu/dolma-v1_7/arxiv-0000.json.gz"
}
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"text": "\\section{Principle of nano strain-amplifier}\r\n\r\n\\begin{figure*}[t!]\r\n\t\\centering\r\n\t\\includegraphics[width=5.4in]{Fig1}\r\n\t\t\\vspace{-0.5em}\r\n\t\\caption{Schematic sketches of nanowire strain sensors. (a)(b) Conventional non-released and released NW structure; \r\n\t\t(c)(d) The proposed nano strain-amplifier and its simplified physical model.}\r\n\t\\label{fig:fig1}\r\n\t\t\\vspace{-1em}\r\n\\end{figure*}\r\nFigure \\ref{fig:fig1}(a) and 1(b) show the concept of the conventional structures of piezoresistive sensors. The piezoresistive elements are either released from, or kept on, the substrate. The sensitivity ($S$) of the sensors is defined based on the ratio of the relative resistance change ($\\Delta R/R$) of the sensing element and the strain applied to the substrate ($\\varepsilon_{sub}$):\r\n\\begin{equation}\r\nS = (\\Delta R/R)/\\varepsilon_{sub}\r\n\\label{eq:sensitivity}\r\n\\end{equation}\r\nIn addition, the relative resistance change $\\Delta R/R$ can be calculated from the gauge factor ($GF$) of the material used to make the piezoresistive elements: $\\Delta R/R = GF \\varepsilon_{ind}$, where $\\varepsilon_{ind}$ is the strain induced into the piezoresistor. In most of the conventional strain gauges as shown in Fig. \\ref{fig:fig1} (a,b), the thickness of the sensing layer is typically below a few hundred nanometers, which is much smaller than that of the substrate. Therefore, the strain induced into the piezoresistive elements is approximately the same as that of the substrate ($\\varepsilon_{ind} \\approx \\varepsilon_{sub}$). Consequently, to improve the sensitivity of strain sensors (e.g. enlarging $\\Delta R/R$), electrical approaches which can enlarge the gauge factor ($GF$) are required. Nevertheless, as aforementioned, the existence of the large gauge factor in nanowires due to quantum confinement or surface state, is still considered as controversial. \n\r\nIt is also evident from Eq. \\ref{eq:sensitivity} that the sensitivity of strain sensors can also be improved using a mechanical approach, which enlarges the strain induced into the piezoresistive element. Figure \\ref{fig:fig1}(c) shows our proposed nano strain-amplifier structure, in which the piezoresistive nanowires are locally fabricated at the centre of a released bridge. The key idea of this structure is that, under a certain strain applied to the substrate, a large strain will be concentrated at the locally fabricated SiC nanowires. The working principle of the nano strain-amplifier is similar to that of the well-known dogbone structure, which is widely used to characterize the tensile strength of materials \\cite{dogbone1,dogbone2}. That is, when a stress is applied to the dogbone-shape of a certain material, a crack, if generated, will occur at the middle part of the dogbone. The large strain concentrated at the narrow area located at the centre part with respect to the wider areas located at outer region, causes the crack. Qualitative and quantitative explanations of the nano strain-amplifier are presented as follows. \r\n\r\nFor the sake of simplicity, the released micro frame and nanowire (single wire or array) of the nano strain-amplifier can be considered as solid springs, Fig. \\ref{fig:fig1}(d). The stiffness of these springs are proportional to their width ($w$) and inversely proportional to their length (l): $K \\propto w/l$. Consequently, the model of the released nanowire and micro frames can be simplified as a series of springs, where the springs with higher stiffness correspond to the micro frame, and the single spring with lower stiffness corresponds to the nanowire. It is well-known in classical physics that, for serially connected springs, a larger strain will be concentrated in the low--stiffness string, while a smaller strain will be induced in the high--stiffness string \\cite{Springbook}. The following analysis quantitatively explained the amplification of the strain.\t\r\n\r\n\\begin{figure}[b!]\r\n\t\\centering\r\n\t\\includegraphics[width=3in]{Fig2}\r\n\t\\vspace{-1em}\r\n\t\\caption{Finite element analysis of the strain induced in to the nanowire array utilizing nano strain-amplifier.}\r\n\t\\label{fig:fig2}\r\n\\end{figure}\r\nWhen a tensile mechanical strain ($\\varepsilon_{sub}$) is applied to the substrate, the released structure will also be elongated. Since the stiffness of the released frame is much smaller than that of the substrate, it is safe to assume that the released structure will follows the elongation of the substrate. The displacement of the released structure $\\Delta L$ is:\r\n\\begin{equation}\r\n\\Delta L = \\Delta L_m + \\Delta L_n = L_m \\varepsilon_m + L_n \\varepsilon_n\r\n\\label{eq:displacement}\r\n\\end{equation} \r\nwhere $L_m$, $L_n$ are the length; $\\Delta L_m$, $\\Delta L_n$ are the displacement; and $\\varepsilon_m$, $\\varepsilon_n$ are the strains induced into the micro spring and nano spring, respectively. The subscripts m and n stand for the micro frames and nanowires, respectively. Furthermore, due to the equilibrium of the stressing force ($F$) along the series of springs, the following relationship is established: $F= K_m\\Delta L_m = K_n \\Delta L_n$, where $K_m$, $K_n$ are the stiffness of the released micro frames and nanowires, respectively. Consequently the relationship between the displacement of the micro frame (higher stiffness) and nanowires (lower stiffness) is:\r\n\\begin{equation}\r\n\\frac{\\Delta L_m}{\\Delta L_n}=\\frac{K_n}{K_m}=\\frac{L_mw_n}{L_nw_m}\r\n\\label{eq:euili}\r\n\\end{equation}\r\nSubstituting Eqn. \\ref{eq:euili} into Eqn. \\ref{eq:displacement}, the strain induced into the locally fabricated nanowires is:\r\n\\begin{equation}\r\n\\varepsilon_n = \\frac{\\Delta L_n}{L_n} = \\frac{1}{1-\\frac{w_m-w_n}{w_m}\\frac{L_m}{L}}\\varepsilon_{sub}\r\n\\label{eq:strainamp}\r\n\\end{equation} \r\n\r\nEquation \\ref{eq:strainamp} indicates that increasing the ratio of $w_m/w_n$ and $L_m/L_n$ significantly amplifies the strain induced into the nanowire from the strain applied to the substrate. This model is also applicable to the case of nanowire arrays, in which $w_n$ is the total width of all nanowires in the array.\n\r\nThe theoretical model is then verified using the finite element analysis (FEA). In the FEA simulation, we compare the strain induced into (i) non released nanowires, (ii) the conventionally released nanowires, and (iii) our nano strain-amplifier structure, using COMSOL Multiphysics \\texttrademark. In our nano strain amplifying structure, the width of the released frame was set to be 8 $\\mu$m, while the width of each nanowire in the array (3 wires) was set to be 370 nm. The nanowires array structure was selected as it can enhance the electrical conductance of the SiC nanowires resistor which makes the subsequent experimental demonstration easier. The ratio between the length of nanowires and micro bridge was set to be 1: 20. With this geometrical dimensions, strain induced into nanowires array $\\varepsilon_n$ was numerically calculated to be approximately 6 times larger than $\\varepsilon_{sub}$, Eqn. \\ref{eq:strainamp}. The simulation results show that for all structure, the elongation of non-released and released nanowires follow that of the substrate. In addition, strain was almost completely transferred into conventional released and non-released structures. Furthermore, the ratio of the strain induced in to the locally fabricated nanowires was estimated to be 5.9 times larger than that of the substrate, Fig. \\ref{fig:fig2}. These results are in solid agreement with the theoretical analysis presented above. For a nanowire array with an average width of 470 nm, the amplified gain of strain was found to be 4.5. \t\r\n\r\nBased on the theoretical analysis, we conducted the following experiments to demonstrate the high sensitivity of SiC nanowire strain sensors using the nano strain-amplifier. A thin 3C-SiC film with its thickness of 300 nm was epitaxially grown on a 150 mm diameter Si wafer using low pressure chemical vapour deposition \\cite{SiC_growth}. The film was \\emph{in situ} doped using Al dopants. The carrier concentration of the p-type 3C-SiC was found to be $5 \\times 10^{18}$ cm$^{-3}$, using a hot probe technique \\cite{philip}. The details of the characteristics of the grown film can be found elsewhere \\cite{Phan_JMC}. Subsequently, I-shape p-type SiC resistors with aluminum electrodes deposited on the surface were patterned using inductive coupled plasma (ICP) etching. As the piezoresistance of p-type 3C-SiC depends on crystallographic orientation, all SiC resistors of the present work were aligned along [110] direction to maximize the piezoresistive effect. Next, the micro scale SiC resistors were then released from the Si substrate using dry etching (XeF$_2$). Finally, SiC nanowire arrays were formed at the centre of the released bridge using focused ion beam (FIB). Two types of nanowire array were fabricated with three nanowires for each array. The average width of each nanowire in each type were 380 nm and 470 nm, respectively. Figure \\ref{fig:fig3} shows the SEM images of the fabricated samples, including the conventional released structure, non-released nanowires, and the nano strain-amplifier. \r\n\r\n\\begin{figure}[t!]\r\n\t\\centering\r\n\t\\includegraphics[width=3in]{Fig3}\r\n\t\\caption{SEM image of SiC strain sensors. (a) Released SiC micro bridge used for the subsequent fabrication of the nano strain-amplifier; (b) SEM of a micro SiC resistor where the SiC nanowires array were formed using FIB; (c) SEM of non-released SiC nanowires; (d) SEM of locally fabricated SiC nanowires released from the Si substrate (nano strain-amplifier).}\r\n\t\\label{fig:fig3}\r\n\t\\vspace{-1em}\r\n\\end{figure}\r\nThe current voltage (I-V) curves of all fabricated samples were characterized using a HP 4145 \\texttrademark ~parameter analyzer. The linear relationship between the applied voltage and measured current, indicated that Al made a good Ohmic contact with the highly doped SiC resistance, Fig. \\ref{fig:IV}. Additionally, the electrical conductivity of both nanowires and micro frame estimated from the I-V curve and the dimensions of the resistors shows almost the same value. This indicated that the FIB process did not cause a significant surface damage to the fabricated nanowires. \r\n\t\r\n\\begin{figure}[b!]\r\n\t\\centering\r\n\t\\includegraphics[width=3in]{Fig4}\r\n\t\t\\vspace{-1.5em}\r\n\t\\caption{Current voltage curves of the fabricated SiC resistors.}\r\n\t\\label{fig:IV}\r\n\n\\end{figure}\r\n\r\nThe bending experiment was used to characterize the piezoresistive effect in micro size SiC resistors and locally fabricated SiC nanowire array. In this experiment one end of the Si cantilever (with a thickness of 625 $\\mu$m, and a width of 7 mm) was fixed while the other end was deflected by applying different forces. The distance from the fabricated nanowires to the free end of the Si cantilever was approximately 45 mm. The strain induced into the Si substrate is $\\varepsilon_\\text{sub} = Mt/2EI$, where $M$ is the applied bending moment; and $t$, $E$ and $I$ are the thickness, Young's modulus and the moment of inertia of the Si cantilever, respectively. The response of the SiC resistance to applied strain was then measured using a multimeter (Agilent \\texttrademark 34401 A).\n\r\n\\begin{figure}[h!]\r\n\t\\centering\r\n\t\\includegraphics[width=3in]{Fig5.eps}\r\n\t\t\\vspace{-1.5em}\r\n\t\\caption{Experimental results. (a) A comparision between the relative resistance change in the nano strain-amplifiers, non released nanowires and released micro frames; (b) The repeatability of the SiC nanowires strain sensors utilizing the proposed structure.}\r\n\t\\label{fig:DRR}\r\n\t\t\t\\vspace{-1em}\r\n\\end{figure}\t\r\nThe relative resistance change ($\\Delta R/R$) of the micro and nano SiC resistors was plotted against the strain induced into the Si substrate $\\varepsilon_{sub}$, Fig. \\ref{fig:DRR}(a). For all fabricated samples, the relative resistance change shows a good linear relationship with the applied strain ($\\varepsilon_{sub}$). In addition, with the same applied strain to the Si substrate, the resistance change of the SiC nanowires using the nano strain-amplifier was much larger than that of the the SiC micro resistor and the conventional non-released SiC nanowires. In addition, reducing the width of the SiC nanowires also resulted in the increase of the sensitivity. The magnitude of the piezoresistive effect in the nano strain-amplifier as well as conventional structures were then quantitatively evaluated based on the effective gauge factor ($GF_{eff}$), which is defined as the ratio of the relative resistance change to the applied strain to the substrate: $GF_{eff} = (\\Delta R/R)/\\varepsilon_{sub}$. Accordingly, the effective gauge factor of the released micro SiC was found to be 28, while that of the non-released SiC nanowires was 35. From the data shown in Fig. \\ref{fig:DRR}, the effective gauge factor of the 380 nm and 470 nm SiC nanowires in the nano strain-amplifier were calculated as 150 and 124, respectively. Thus for nanowire arrays with average widths of 380 nm and 470 nm, the sensitivity of the nano strain-amplifier was 5.4 times and 4.6 times larger than the bulk SiC, respectively. These results were consistent with analytical and numerical models presented above. The relative resistance change of the nano strain-amplifier also showed excellent linearity with the applied strain, with a linear regression of above 99\\%. \r\n\r\nThe resistance change of the nano strain-amplifier can also be converted into voltage signals using a Wheatstone bridge, Fig. \\ref{fig:DRR}(b). The output voltage of the nano strain-amplifier increases with increasing tensile strains from 0 ppm to 180 ppm, and returned to the initial value when the strain was completely removed, confirming a good repeatability after several strain induced cycles. The linearity of the relative resistance change, and the repeatability indicate that the proposed structure is promising for strain sensing applications.\r\n \r\nIn conclusion, this work presents a novel mechanical approach to obtain highly sensitive piezoresistance in nanowires based on a nano strain-amplifier. The key factor of the nano strain-amplifier lies on nanowires locally fabricated on a released micro structure. Experimental studies were conducted on SiC nanowires, confirming that by utilizing our nano strain-amplifier, the sensitivity of SiC nanowires was 5.4 times larger than that of conventional structures. This result indicated that the nano strain-amplifier is an excellent platform for ultra sensitive strain sensing applications. \r\n\r\n\r\n",
"id": "1b77ae9f541b19668cc96624c7ec0f83945284e2",
"metadata": {
"file_path": "/home/ubuntu/dolma-v1_7/arxiv-0000.json.gz"
}
},
"truncated_cells": []
}
]
| + | USER_QUERY: code vulnerability dataset | HUB_DATASET_PREVIEW: DATASET_NAME: "benjis/bigvul"
FEATURES: {'CVE ID': {'dtype': 'string', '_type': 'Value'}, 'CVE Page': {'dtype': 'string', '_type': 'Value'}, 'CWE ID': {'dtype': 'string', '_type': 'Value'}, 'codeLink': {'dtype': 'string', '_type': 'Value'}, 'commit_id': {'dtype': 'string', '_type': 'Value'}, 'commit_message': {'dtype': 'string', '_type': 'Value'}, 'func_after': {'dtype': 'string', '_type': 'Value'}, 'func_before': {'dtype': 'string', '_type': 'Value'}, 'lang': {'dtype': 'string', '_type': 'Value'}, 'project': {'dtype': 'string', '_type': 'Value'}, 'vul': {'dtype': 'int8', '_type': 'Value'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"CVE ID": "CVE-2017-7586",
"CVE Page": "https://www.cvedetails.com/cve/CVE-2017-7586/",
"CWE ID": "CWE-119",
"codeLink": "https://github.com/erikd/libsndfile/commit/708e996c87c5fae77b104ccfeb8f6db784c32074",
"commit_id": "708e996c87c5fae77b104ccfeb8f6db784c32074",
"commit_message": "src/ : Move to a variable length header buffer\n\nPreviously, the `psf->header` buffer was a fixed length specified by\n`SF_HEADER_LEN` which was set to `12292`. This was problematic for\ntwo reasons; this value was un-necessarily large for the majority\nof files and too small for some others.\n\nNow the size of the header buffer starts at 256 bytes and grows as\nnecessary up to a maximum of 100k.",
"func_after": "psf_get_date_str (char *str, int maxlen)\n{\ttime_t\t\tcurrent ;\n\tstruct tm\ttimedata, *tmptr ;\n\n\ttime (¤t) ;\n\n#if defined (HAVE_GMTIME_R)\n\t/* If the re-entrant version is available, use it. */\n\ttmptr = gmtime_r (¤t, &timedata) ;\n#elif defined (HAVE_GMTIME)\n\t/* Otherwise use the standard one and copy the data to local storage. */\n\ttmptr = gmtime (¤t) ;\n\tmemcpy (&timedata, tmptr, sizeof (timedata)) ;\n#else\n\ttmptr = NULL ;\n#endif\n\n\tif (tmptr)\n\t\tsnprintf (str, maxlen, \"%4d-%02d-%02d %02d:%02d:%02d UTC\",\n\t\t\t1900 + timedata.tm_year, timedata.tm_mon, timedata.tm_mday,\n\t\t\ttimedata.tm_hour, timedata.tm_min, timedata.tm_sec) ;\n\telse\n\t\tsnprintf (str, maxlen, \"Unknown date\") ;\n\n\treturn ;\n} /* psf_get_date_str */\n",
"func_before": "psf_get_date_str (char *str, int maxlen)\n{\ttime_t\t\tcurrent ;\n\tstruct tm\ttimedata, *tmptr ;\n\n\ttime (¤t) ;\n\n#if defined (HAVE_GMTIME_R)\n\t/* If the re-entrant version is available, use it. */\n\ttmptr = gmtime_r (¤t, &timedata) ;\n#elif defined (HAVE_GMTIME)\n\t/* Otherwise use the standard one and copy the data to local storage. */\n\ttmptr = gmtime (¤t) ;\n\tmemcpy (&timedata, tmptr, sizeof (timedata)) ;\n#else\n\ttmptr = NULL ;\n#endif\n\n\tif (tmptr)\n\t\tsnprintf (str, maxlen, \"%4d-%02d-%02d %02d:%02d:%02d UTC\",\n\t\t\t1900 + timedata.tm_year, timedata.tm_mon, timedata.tm_mday,\n\t\t\ttimedata.tm_hour, timedata.tm_min, timedata.tm_sec) ;\n\telse\n\t\tsnprintf (str, maxlen, \"Unknown date\") ;\n\n\treturn ;\n} /* psf_get_date_str */\n",
"lang": "C",
"project": "libsndfile",
"vul": 0
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"CVE ID": "CVE-2018-18352",
"CVE Page": "https://www.cvedetails.com/cve/CVE-2018-18352/",
"CWE ID": "CWE-732",
"codeLink": "https://github.com/chromium/chromium/commit/a9cbaa7a40e2b2723cfc2f266c42f4980038a949",
"commit_id": "a9cbaa7a40e2b2723cfc2f266c42f4980038a949",
"commit_message": "Simplify \"WouldTaintOrigin\" concept in media/blink\n\nCurrently WebMediaPlayer has three predicates:\n - DidGetOpaqueResponseFromServiceWorker\n - HasSingleSecurityOrigin\n - DidPassCORSAccessCheck\n. These are used to determine whether the response body is available\nfor scripts. They are known to be confusing, and actually\nMediaElementAudioSourceHandler::WouldTaintOrigin misuses them.\n\nThis CL merges the three predicates to one, WouldTaintOrigin, to remove\nthe confusion. Now the \"response type\" concept is available and we\ndon't need a custom CORS check, so this CL removes\nBaseAudioContext::WouldTaintOrigin. This CL also renames\nURLData::has_opaque_data_ and its (direct and indirect) data accessors\nto match the spec.\n\nBug: 849942, 875153\nChange-Id: I6acf50169d7445c4ff614e80ac606f79ee577d2a\nReviewed-on: https://chromium-review.googlesource.com/c/1238098\nReviewed-by: Fredrik Hubinette \nReviewed-by: Kinuko Yasuda \nReviewed-by: Raymond Toy \nCommit-Queue: Yutaka Hirano \nCr-Commit-Position: refs/heads/master@{#598258}",
"func_after": "void MultibufferDataSource::CreateResourceLoader(int64_t first_byte_position,\n int64_t last_byte_position) {\n DCHECK(render_task_runner_->BelongsToCurrentThread());\n\n SetReader(new MultiBufferReader(\n url_data()->multibuffer(), first_byte_position, last_byte_position,\n base::Bind(&MultibufferDataSource::ProgressCallback, weak_ptr_)));\n reader_->SetIsClientAudioElement(is_client_audio_element_);\n UpdateBufferSizes();\n}\n",
"func_before": "void MultibufferDataSource::CreateResourceLoader(int64_t first_byte_position,\n int64_t last_byte_position) {\n DCHECK(render_task_runner_->BelongsToCurrentThread());\n\n SetReader(new MultiBufferReader(\n url_data()->multibuffer(), first_byte_position, last_byte_position,\n base::Bind(&MultibufferDataSource::ProgressCallback, weak_ptr_)));\n reader_->SetIsClientAudioElement(is_client_audio_element_);\n UpdateBufferSizes();\n}\n",
"lang": "C",
"project": "Chrome",
"vul": 0
},
"truncated_cells": []
}
]
| NEGATIVE: DATASET_NAME: "sfakhoury/NL2Fix"
FEATURES: {'defects4j_project': {'dtype': 'string', '_type': 'Value'}, 'defects4j_bug_id': {'dtype': 'string', '_type': 'Value'}, 'file_path': {'dtype': 'string', '_type': 'Value'}, 'bug_start_line': {'dtype': 'string', '_type': 'Value'}, 'bug_end_line': {'dtype': 'string', '_type': 'Value'}, 'issue_title': {'dtype': 'string', '_type': 'Value'}, 'issue_description': {'dtype': 'string', '_type': 'Value'}, 'original_src': {'dtype': 'string', '_type': 'Value'}, 'original_src_wo_comments': {'dtype': 'string', '_type': 'Value'}, 'fixed_src': {'dtype': 'string', '_type': 'Value'}, 'fixed_src_wo_comments': {'dtype': 'string', '_type': 'Value'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"defects4j_project": "Math",
"defects4j_bug_id": "19",
"file_path": "src/main/java/org/apache/commons/math3/optimization/direct/CMAESOptimizer.java",
"bug_start_line": "504",
"bug_end_line": "561",
"issue_title": "Wide bounds to CMAESOptimizer result in NaN parameters passed to fitness function",
"issue_description": "If you give large values as lower/upper bounds (for example -Double.MAX_VALUE as a lower bound), the optimizer can call the fitness function with parameters set to NaN. My guess is this is due to FitnessFunction.encode/decode generating NaN when normalizing/denormalizing parameters. For example, if the difference between the lower and upper bound is greater than Double.MAX_VALUE, encode could divide infinity by infinity.",
"original_src": "private void checkParameters() {\n final double[] init = getStartPoint();\n final double[] lB = getLowerBound();\n final double[] uB = getUpperBound();\n\n // Checks whether there is at least one finite bound value.\n boolean hasFiniteBounds = false;\n for (int i = 0; i < lB.length; i++) {\n if (!Double.isInfinite(lB[i]) ||\n !Double.isInfinite(uB[i])) {\n hasFiniteBounds = true;\n break;\n }\n }\n // Checks whether there is at least one infinite bound value.\n boolean hasInfiniteBounds = false;\n if (hasFiniteBounds) {\n for (int i = 0; i < lB.length; i++) {\n if (Double.isInfinite(lB[i]) ||\n Double.isInfinite(uB[i])) {\n hasInfiniteBounds = true;\n break;\n }\n }\n\n if (hasInfiniteBounds) {\n // If there is at least one finite bound, none can be infinite,\n // because mixed cases are not supported by the current code.\n throw new MathUnsupportedOperationException();\n } else {\n // Convert API to internal handling of boundaries.\n boundaries = new double[2][];\n boundaries[0] = lB;\n boundaries[1] = uB;\n\n // Abort early if the normalization will overflow (cf. \"encode\" method).\n }\n } else {\n // Convert API to internal handling of boundaries.\n boundaries = null;\n }\n\n if (inputSigma != null) {\n if (inputSigma.length != init.length) {\n throw new DimensionMismatchException(inputSigma.length, init.length);\n }\n for (int i = 0; i < init.length; i++) {\n if (inputSigma[i] < 0) {\n throw new NotPositiveException(inputSigma[i]);\n }\n if (boundaries != null) {\n if (inputSigma[i] > boundaries[1][i] - boundaries[0][i]) {\n throw new OutOfRangeException(inputSigma[i], 0, boundaries[1][i] - boundaries[0][i]);\n }\n }\n }\n }\n }",
"original_src_wo_comments": "private void checkParameters ( ) { final double [ ] init = getStartPoint ( ) ; final double [ ] lB = getLowerBound ( ) ; final double [ ] uB = getUpperBound ( ) ; boolean hasFiniteBounds = false ; for ( int i = 0 ; i < lB . length ; i ++ ) { if ( ! Double . isInfinite ( lB [ i ] ) || ! Double . isInfinite ( uB [ i ] ) ) { hasFiniteBounds = true ; break ; } } boolean hasInfiniteBounds = false ; if ( hasFiniteBounds ) { for ( int i = 0 ; i < lB . length ; i ++ ) { if ( Double . isInfinite ( lB [ i ] ) || Double . isInfinite ( uB [ i ] ) ) { hasInfiniteBounds = true ; break ; } } if ( hasInfiniteBounds ) { throw new MathUnsupportedOperationException ( ) ; } else { boundaries = new double [ 2 ] [ ] ; boundaries [ 0 ] = lB ; boundaries [ 1 ] = uB ; } } else { boundaries = null ; } if ( inputSigma != null ) { if ( inputSigma . length != init . length ) { throw new DimensionMismatchException ( inputSigma . length , init . length ) ; } for ( int i = 0 ; i < init . length ; i ++ ) { if ( inputSigma [ i ] < 0 ) { throw new NotPositiveException ( inputSigma [ i ] ) ; } if ( boundaries != null ) { if ( inputSigma [ i ] > boundaries [ 1 ] [ i ] - boundaries [ 0 ] [ i ] ) { throw new OutOfRangeException ( inputSigma [ i ] , 0 , boundaries [ 1 ] [ i ] - boundaries [ 0 ] [ i ] ) ; } } } } }",
"fixed_src": "private void checkParameters() {\n final double[] init = getStartPoint();\n final double[] lB = getLowerBound();\n final double[] uB = getUpperBound();\n\n // Checks whether there is at least one finite bound value.\n boolean hasFiniteBounds = false;\n for (int i = 0; i < lB.length; i++) {\n if (!Double.isInfinite(lB[i]) ||\n !Double.isInfinite(uB[i])) {\n hasFiniteBounds = true;\n break;\n }\n }\n // Checks whether there is at least one infinite bound value.\n boolean hasInfiniteBounds = false;\n if (hasFiniteBounds) {\n for (int i = 0; i < lB.length; i++) {\n if (Double.isInfinite(lB[i]) ||\n Double.isInfinite(uB[i])) {\n hasInfiniteBounds = true;\n break;\n }\n }\n\n if (hasInfiniteBounds) {\n // If there is at least one finite bound, none can be infinite,\n // because mixed cases are not supported by the current code.\n throw new MathUnsupportedOperationException();\n } else {\n // Convert API to internal handling of boundaries.\n boundaries = new double[2][];\n boundaries[0] = lB;\n boundaries[1] = uB;\n\n // Abort early if the normalization will overflow (cf. \"encode\" method).\n for (int i = 0; i < lB.length; i++) {\n if (Double.isInfinite(boundaries[1][i] - boundaries[0][i])) {\n final double max = Double.MAX_VALUE + boundaries[0][i];\n final NumberIsTooLargeException e\n = new NumberIsTooLargeException(boundaries[1][i],\n max,\n true);\n e.getContext().addMessage(LocalizedFormats.OVERFLOW);\n e.getContext().addMessage(LocalizedFormats.INDEX, i);\n\n throw e;\n }\n }\n }\n } else {\n // Convert API to internal handling of boundaries.\n boundaries = null;\n }\n\n if (inputSigma != null) {\n if (inputSigma.length != init.length) {\n throw new DimensionMismatchException(inputSigma.length, init.length);\n }\n for (int i = 0; i < init.length; i++) {\n if (inputSigma[i] < 0) {\n throw new NotPositiveException(inputSigma[i]);\n }\n if (boundaries != null) {\n if (inputSigma[i] > boundaries[1][i] - boundaries[0][i]) {\n throw new OutOfRangeException(inputSigma[i], 0, boundaries[1][i] - boundaries[0][i]);\n }\n }\n }\n }\n }",
"fixed_src_wo_comments": "private void checkParameters ( ) { final double [ ] init = getStartPoint ( ) ; final double [ ] lB = getLowerBound ( ) ; final double [ ] uB = getUpperBound ( ) ; boolean hasFiniteBounds = false ; for ( int i = 0 ; i < lB . length ; i ++ ) { if ( ! Double . isInfinite ( lB [ i ] ) || ! Double . isInfinite ( uB [ i ] ) ) { hasFiniteBounds = true ; break ; } } boolean hasInfiniteBounds = false ; if ( hasFiniteBounds ) { for ( int i = 0 ; i < lB . length ; i ++ ) { if ( Double . isInfinite ( lB [ i ] ) || Double . isInfinite ( uB [ i ] ) ) { hasInfiniteBounds = true ; break ; } } if ( hasInfiniteBounds ) { throw new MathUnsupportedOperationException ( ) ; } else { boundaries = new double [ 2 ] [ ] ; boundaries [ 0 ] = lB ; boundaries [ 1 ] = uB ; for ( int i = 0 ; i < lB . length ; i ++ ) { if ( Double . isInfinite ( boundaries [ 1 ] [ i ] - boundaries [ 0 ] [ i ] ) ) { final double max = Double . MAX_VALUE + boundaries [ 0 ] [ i ] ; final NumberIsTooLargeException e = new NumberIsTooLargeException ( boundaries [ 1 ] [ i ] , max , true ) ; e . getContext ( ) . addMessage ( LocalizedFormats . OVERFLOW ) ; e . getContext ( ) . addMessage ( LocalizedFormats . INDEX , i ) ; throw e ; } } } } else { boundaries = null ; } if ( inputSigma != null ) { if ( inputSigma . length != init . length ) { throw new DimensionMismatchException ( inputSigma . length , init . length ) ; } for ( int i = 0 ; i < init . length ; i ++ ) { if ( inputSigma [ i ] < 0 ) { throw new NotPositiveException ( inputSigma [ i ] ) ; } if ( boundaries != null ) { if ( inputSigma [ i ] > boundaries [ 1 ] [ i ] - boundaries [ 0 ] [ i ] ) { throw new OutOfRangeException ( inputSigma [ i ] , 0 , boundaries [ 1 ] [ i ] - boundaries [ 0 ] [ i ] ) ; } } } } }"
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"defects4j_project": "Compress",
"defects4j_bug_id": "16",
"file_path": "src/main/java/org/apache/commons/compress/archivers/ArchiveStreamFactory.java",
"bug_start_line": "197",
"bug_end_line": "258",
"issue_title": "Too relaxed tar detection in ArchiveStreamFactory",
"issue_description": "The relaxed tar detection logic added in COMPRESS-117 unfortunately matches also some non-tar files like a [test AIFF file|https://svn.apache.org/repos/asf/tika/trunk/tika-parsers/src/test/resources/test-documents/testAIFF.aif] that Apache Tika uses. It would be good to improve the detection heuristics to still match files like the one in COMPRESS-117 but avoid false positives like the AIFF file in Tika.",
"original_src": "public ArchiveInputStream createArchiveInputStream(final InputStream in)\n throws ArchiveException {\n if (in == null) {\n throw new IllegalArgumentException(\"Stream must not be null.\");\n }\n\n if (!in.markSupported()) {\n throw new IllegalArgumentException(\"Mark is not supported.\");\n }\n\n final byte[] signature = new byte[12];\n in.mark(signature.length);\n try {\n int signatureLength = in.read(signature);\n in.reset();\n if (ZipArchiveInputStream.matches(signature, signatureLength)) {\n return new ZipArchiveInputStream(in);\n } else if (JarArchiveInputStream.matches(signature, signatureLength)) {\n return new JarArchiveInputStream(in);\n } else if (ArArchiveInputStream.matches(signature, signatureLength)) {\n return new ArArchiveInputStream(in);\n } else if (CpioArchiveInputStream.matches(signature, signatureLength)) {\n return new CpioArchiveInputStream(in);\n }\n\n // Dump needs a bigger buffer to check the signature;\n final byte[] dumpsig = new byte[32];\n in.mark(dumpsig.length);\n signatureLength = in.read(dumpsig);\n in.reset();\n if (DumpArchiveInputStream.matches(dumpsig, signatureLength)) {\n return new DumpArchiveInputStream(in);\n }\n\n // Tar needs an even bigger buffer to check the signature; read the first block\n final byte[] tarheader = new byte[512];\n in.mark(tarheader.length);\n signatureLength = in.read(tarheader);\n in.reset();\n if (TarArchiveInputStream.matches(tarheader, signatureLength)) {\n return new TarArchiveInputStream(in);\n }\n // COMPRESS-117 - improve auto-recognition\n if (signatureLength >= 512) {\n try {\n TarArchiveInputStream tais = new TarArchiveInputStream(new ByteArrayInputStream(tarheader));\n // COMPRESS-191 - verify the header checksum\n tais.getNextEntry();\n return new TarArchiveInputStream(in);\n } catch (Exception e) { // NOPMD\n // can generate IllegalArgumentException as well\n // as IOException\n // autodetection, simply not a TAR\n // ignored\n }\n }\n } catch (IOException e) {\n throw new ArchiveException(\"Could not use reset and mark operations.\", e);\n }\n\n throw new ArchiveException(\"No Archiver found for the stream signature\");\n }",
"original_src_wo_comments": "public ArchiveInputStream createArchiveInputStream ( final InputStream in ) throws ArchiveException { if ( in == null ) { throw new IllegalArgumentException ( \"Stream must not be null.\" ) ; } if ( ! in . markSupported ( ) ) { throw new IllegalArgumentException ( \"Mark is not supported.\" ) ; } final byte [ ] signature = new byte [ 12 ] ; in . mark ( signature . length ) ; try { int signatureLength = in . read ( signature ) ; in . reset ( ) ; if ( ZipArchiveInputStream . matches ( signature , signatureLength ) ) { return new ZipArchiveInputStream ( in ) ; } else if ( JarArchiveInputStream . matches ( signature , signatureLength ) ) { return new JarArchiveInputStream ( in ) ; } else if ( ArArchiveInputStream . matches ( signature , signatureLength ) ) { return new ArArchiveInputStream ( in ) ; } else if ( CpioArchiveInputStream . matches ( signature , signatureLength ) ) { return new CpioArchiveInputStream ( in ) ; } final byte [ ] dumpsig = new byte [ 32 ] ; in . mark ( dumpsig . length ) ; signatureLength = in . read ( dumpsig ) ; in . reset ( ) ; if ( DumpArchiveInputStream . matches ( dumpsig , signatureLength ) ) { return new DumpArchiveInputStream ( in ) ; } final byte [ ] tarheader = new byte [ 512 ] ; in . mark ( tarheader . length ) ; signatureLength = in . read ( tarheader ) ; in . reset ( ) ; if ( TarArchiveInputStream . matches ( tarheader , signatureLength ) ) { return new TarArchiveInputStream ( in ) ; } if ( signatureLength >= 512 ) { try { TarArchiveInputStream tais = new TarArchiveInputStream ( new ByteArrayInputStream ( tarheader ) ) ; tais . getNextEntry ( ) ; return new TarArchiveInputStream ( in ) ; } catch ( Exception e ) { } } } catch ( IOException e ) { throw new ArchiveException ( \"Could not use reset and mark operations.\" , e ) ; } throw new ArchiveException ( \"No Archiver found for the stream signature\" ) ; }",
"fixed_src": "public ArchiveInputStream createArchiveInputStream(final InputStream in)\n throws ArchiveException {\n if (in == null) {\n throw new IllegalArgumentException(\"Stream must not be null.\");\n }\n\n if (!in.markSupported()) {\n throw new IllegalArgumentException(\"Mark is not supported.\");\n }\n\n final byte[] signature = new byte[12];\n in.mark(signature.length);\n try {\n int signatureLength = in.read(signature);\n in.reset();\n if (ZipArchiveInputStream.matches(signature, signatureLength)) {\n return new ZipArchiveInputStream(in);\n } else if (JarArchiveInputStream.matches(signature, signatureLength)) {\n return new JarArchiveInputStream(in);\n } else if (ArArchiveInputStream.matches(signature, signatureLength)) {\n return new ArArchiveInputStream(in);\n } else if (CpioArchiveInputStream.matches(signature, signatureLength)) {\n return new CpioArchiveInputStream(in);\n }\n\n // Dump needs a bigger buffer to check the signature;\n final byte[] dumpsig = new byte[32];\n in.mark(dumpsig.length);\n signatureLength = in.read(dumpsig);\n in.reset();\n if (DumpArchiveInputStream.matches(dumpsig, signatureLength)) {\n return new DumpArchiveInputStream(in);\n }\n\n // Tar needs an even bigger buffer to check the signature; read the first block\n final byte[] tarheader = new byte[512];\n in.mark(tarheader.length);\n signatureLength = in.read(tarheader);\n in.reset();\n if (TarArchiveInputStream.matches(tarheader, signatureLength)) {\n return new TarArchiveInputStream(in);\n }\n // COMPRESS-117 - improve auto-recognition\n if (signatureLength >= 512) {\n try {\n TarArchiveInputStream tais = new TarArchiveInputStream(new ByteArrayInputStream(tarheader));\n // COMPRESS-191 - verify the header checksum\n if (tais.getNextTarEntry().isCheckSumOK()) {\n return new TarArchiveInputStream(in);\n }\n } catch (Exception e) { // NOPMD\n // can generate IllegalArgumentException as well\n // as IOException\n // autodetection, simply not a TAR\n // ignored\n }\n }\n } catch (IOException e) {\n throw new ArchiveException(\"Could not use reset and mark operations.\", e);\n }\n\n throw new ArchiveException(\"No Archiver found for the stream signature\");\n }",
"fixed_src_wo_comments": "public ArchiveInputStream createArchiveInputStream ( final InputStream in ) throws ArchiveException { if ( in == null ) { throw new IllegalArgumentException ( \"Stream must not be null.\" ) ; } if ( ! in . markSupported ( ) ) { throw new IllegalArgumentException ( \"Mark is not supported.\" ) ; } final byte [ ] signature = new byte [ 12 ] ; in . mark ( signature . length ) ; try { int signatureLength = in . read ( signature ) ; in . reset ( ) ; if ( ZipArchiveInputStream . matches ( signature , signatureLength ) ) { return new ZipArchiveInputStream ( in ) ; } else if ( JarArchiveInputStream . matches ( signature , signatureLength ) ) { return new JarArchiveInputStream ( in ) ; } else if ( ArArchiveInputStream . matches ( signature , signatureLength ) ) { return new ArArchiveInputStream ( in ) ; } else if ( CpioArchiveInputStream . matches ( signature , signatureLength ) ) { return new CpioArchiveInputStream ( in ) ; } final byte [ ] dumpsig = new byte [ 32 ] ; in . mark ( dumpsig . length ) ; signatureLength = in . read ( dumpsig ) ; in . reset ( ) ; if ( DumpArchiveInputStream . matches ( dumpsig , signatureLength ) ) { return new DumpArchiveInputStream ( in ) ; } final byte [ ] tarheader = new byte [ 512 ] ; in . mark ( tarheader . length ) ; signatureLength = in . read ( tarheader ) ; in . reset ( ) ; if ( TarArchiveInputStream . matches ( tarheader , signatureLength ) ) { return new TarArchiveInputStream ( in ) ; } if ( signatureLength >= 512 ) { try { TarArchiveInputStream tais = new TarArchiveInputStream ( new ByteArrayInputStream ( tarheader ) ) ; if ( tais . getNextTarEntry ( ) . isCheckSumOK ( ) ) { return new TarArchiveInputStream ( in ) ; } } catch ( Exception e ) { } } } catch ( IOException e ) { throw new ArchiveException ( \"Could not use reset and mark operations.\" , e ) ; } throw new ArchiveException ( \"No Archiver found for the stream signature\" ) ; }"
},
"truncated_cells": []
}
]
| + | USER_QUERY: english korean translation dataset | HUB_DATASET_PREVIEW: DATASET_NAME: "yoonjae22/Aihub_translate"
FEATURES: {'instruction': {'dtype': 'string', '_type': 'Value'}, 'output': {'dtype': 'string', '_type': 'Value'}, 'text': {'dtype': 'string', '_type': 'Value'}, 'input': {'dtype': 'string', '_type': 'Value'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"input": "Bible Coloring' is a coloring application that allows you to experience beautiful stories in the Bible.",
"output": "'Bible Coloring'\uc740 \uc131\uacbd\uc758 \uc544\ub984\ub2e4\uc6b4 \uc774\uc57c\uae30\ub97c \uccb4\ud5d8 \ud560 \uc218 \uc788\ub294 \uceec\ub7ec\ub9c1 \uc571\uc785\ub2c8\ub2e4.",
"instruction": "Please translate the English sentence into Korean.",
"text": "Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\nBible Coloring' is a coloring application that allows you to experience beautiful stories in the Bible.\n\n###Response:\n'Bible Coloring'\uc740 \uc131\uacbd\uc758 \uc544\ub984\ub2e4\uc6b4 \uc774\uc57c\uae30\ub97c \uccb4\ud5d8 \ud560 \uc218 \uc788\ub294 \uceec\ub7ec\ub9c1 \uc571\uc785\ub2c8\ub2e4."
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"input": "Do you work at a City bank?",
"output": "\uc528\ud2f0\uc740\ud589\uc5d0\uc11c \uc77c\ud558\uc138\uc694?",
"instruction": "Please translate the English sentence into Korean.",
"text": "Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\nDo you work at a City bank?\n\n###Response:\n\uc528\ud2f0\uc740\ud589\uc5d0\uc11c \uc77c\ud558\uc138\uc694?"
},
"truncated_cells": []
}
]
| NEGATIVE: DATASET_NAME: "werty1248/EnKo-Translation-LongTextOnly-dedup"
FEATURES: {'english': {'dtype': 'string', '_type': 'Value'}, 'korean': {'dtype': 'string', '_type': 'Value'}, 'from': {'dtype': 'string', '_type': 'Value'}, 'category': {'dtype': 'string', '_type': 'Value'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"english": "ROOFTOP GREENING STRUCTURETo provide a structure firmly and easily installing a house cultivation arch-like aggregate in a rooftop greening structure. This rooftop greening structure includes pressingly fixing each of support stands 24 in each of support stand line groups 24A to a rooftop slab surface through a greening support layer 6, using a fastener which pierces into the greening support layer 6, and steps over between each of the support stands 24 and the rooftop slab surface 2, and installing a holding member 36 for holding a house cultivation arch-like aggregate 50 each on the upper end surface of each of the support stands 24 in each of the support stand line groups 24A. As a result of this, the support stand 24 which has stiffness higher than the greening support layer 6, and is firmly fixed to the rooftop slab surface 2 through the greening support layer 6 is used for holding the end part of the arch-like aggregate 50. The holding member 36 for holding the end part of the arch-like aggregate 50 is installed on the upper end surface of the support stand 24 so as to suppress the holding member 36 from burying in soil and increase the visibility.In a rooftop greening structure in which a greening support layer is formed by laying a plurality of greening support panels on the rooftop floor and soil is arranged on the greening support layer, a pair of support stands are placed on the greening support layer. The rows are arranged so as to be separated from each other, and each of the support rows is arranged upright so as to form a row with a plurality of supports having higher rigidity than the greening support layer. Each support pedestal in each support pedestal row group is configured through the greening support layer by using a fastener that penetrates the greening support layer and straddles between each support pedestal and the rooftop floor surface. It is characterized in that it is pressed and fixed to the rooftop floor surface, and the upper end surface of each support stand in each support stand row group is provided with a holding portion for holding an arch-shaped aggregate for house cultivation. Rooftop greening structure.",
"korean": "\uc625\uc0c1 \ub179\ud654 \uad6c\uc870\uc625\uc0c1 \ub179\ud654 \uad6c\uc870\uc5d0 \uc788\uc5b4\uc11c \ud558\uc6b0\uc2a4 \uc7ac\ubc30\uc6a9 \uc544\uce58\ud615 \uace8\uc7ac\ub97c \uacac\uace0\ud558\uace0 \uc6a9\uc774\ud558\uac8c \uace0\uc815\ud558\ub294 \uad6c\uc870\ub97c \uc81c\uacf5\ud55c\ub2e4. \uac01 \uc9c0\uc9c0\ub300\ub82c\uad70 24 A\uc758 \uac01 \uc9c0\uc9c0\ub300 24\ub97c \ub179\ud654 \uc9c0\uc6d0\uce35 6\uc744 \uad00\ud1b5\ud574 \uac01 \uc9c0\uc9c0\ub300 24\uc640 \uc625\uc0c1 \uc2ac\ub798\ube0c\uba74 2 \uc0ac\uc774\ub97c \ub118\ub294 \uace0\uc815\uad6c\ub97c \uc774\uc6a9\ud568\uc73c\ub85c\uc368, \ub179\ud654 \uc9c0\uc6d0\uce35 6\uc744 \ud1b5\ud574 \uc0c1\uae30 \uc625\uc0c1 \uc2ac\ub798\ube0c\uba74\uc5d0 \uac00\uc555 \uace0\uc815\ud558\uace0 \uadf8 \uac01 \uc9c0\uc9c0\ub300\ub82c\uad70 24 A\uc758 \uac01 \uc9c0\uc9c0\ub300 24\uc758 \uc0c1\ub2e8\uba74\uc5d0 \ud558\uc6b0\uc2a4 \uc7ac\ubc30\uc6a9 \uc544\uce58\ud615 \uace8\uc7ac 50\uc744 \uc9c0\uc9c0\ud558\uae30 \uc704\ud55c \uc9c0\uc9c0 \ubd80\uc7ac 36\uc744 \uac01\uac01 \ub9c8\ub828\ud55c\ub2e4. \uc774\uac83\uc5d0 \uc758\ud574 \uc544\uce58\ud615 \uace8\uc7ac 50\uc758 \ub2e8\ubd80\ub97c \uc9c0\uc9c0\ud558\ub294 \uac83\uc73c\ub85c\uc11c \ub179\ud654 \uc9c0\uc6d0\uce35 6\ubcf4\ub2e4 \uac15\uc131\uc774 \ub192\uace0 \uc625\uc0c1 \uc2ac\ub798\ube0c\uba74 2\uc5d0 \ub179\ud654 \uc9c0\uc6d0\uce35 6\uc744 \ud1b5\ud574 \uc81c\ub300\ub85c \uace0\uc815\ub41c \uc9c0\uc9c0\ub300 24\uac00 \uc774\uc6a9\ub418\ub3c4\ub85d \ud55c\ub2e4. \ub610\ud55c \uc544\uce58\ud615 \uace8\uc7ac 50\uc758 \ub2e8\ubd80\ub97c \uc9c0\uc9c0\ud558\ub294 \uc9c0\uc9c0 \ubd80\uc7ac 36\uc744 \uc9c0\uc9c0\ub300 24\uc758 \uc0c1\ub2e8\uba74\uc5d0 \ub9c8\ub828\ud568\uc73c\ub85c\uc368, \ud1a0\uc591\uc5d0 \ud30c\ubb3b\ud788\ub294 \uac83\uc744 \uc5b5\uc81c\ud558\uace0 \uadf8 \uc9c0\uc9c0 \ubd80\uc7ac 36\uc758 \uc2dc\uc778\uc131\uc744 \ud5a5\uc0c1\uc2dc\ud0a8\ub2e4.\uc625\uc0c1 \ubc14\ub2e5\uba74\uc0c1\uc5d0 \ubcf5\uc218\uc758 \ub179\ud654 \uc9c0\uc6d0 \ud328\ub110\uc744 \ubd80\uc124\ud568\uc73c\ub85c\uc368 \ub179\ud654 \uc9c0\uc6d0\uce35\uc774 \ud615\uc131\ub418\uace0 \uc0c1\uae30 \ub179\ud654 \uc9c0\uc6d0\uce35\uc0c1\uc5d0 \ud1a0\uc591\uc774 \ubc30\uc124\ub418\ub294 \uc625\uc0c1 \ub179\ud654 \uad6c\uc870\uc5d0 \uc788\uc5b4\uc11c \uc0c1\uae30 \ub179\ud654 \uc9c0\uc6d0\uce35\uc0c1\uc5d0 \ud55c \uc30d\uc758 \uc9c0\uc9c0\ub300\ub82c\uad70\uc774 \uc11c\ub85c \uc774\uaca9\ub41c \uc0c1\ud0dc\ub97c \uac00\uc9c0\uace0 \ubc30\uce58\ub418\uace0 \uc0c1\uae30 \uac01 \uc9c0\uc9c0\ub300\ub82c\uad70\uc774 \uc0c1\uae30 \ub179\ud654 \uc9c0\uc6d0\uce35\ubcf4\ub2e4 \uac15\uc131\uc774 \ud5a5\uc0c1\ub41c \ubcf5\uc218\uc758 \uc9c0\uc9c0\ub300\ub97c \uac04\uaca9\uc744 \ub450\uba74\uc11c \uc5f4\uc744 \uc774\ub8e8\ub3c4\ub85d \uc785\uc124 \ubc30\uce58\ud568\uc73c\ub85c\uc368 \uad6c\uc131\ub418\uace0 \uc0c1\uae30 \uac01 \uc9c0\uc9c0\ub300\ub82c\uad70\uc758 \uac01 \uc9c0\uc9c0\ub300\uac00 \uc0c1\uae30 \ub179\ud654 \uc9c0\uc6d0\uce35\uc744 \uad00\ud1b5\ud574 \uc0c1\uae30 \uac01 \uc9c0\uc9c0\ub300\uc640 \uc0c1\uae30 \uc625\uc0c1 \ubc14\ub2e5\uba74 \uc0ac\uc774\ub97c \ub118\ub294 \uace0\uc815\uad6c\ub97c \uc774\uc6a9\ud568\uc73c\ub85c\uc368, \uc0c1\uae30 \ub179\ud654 \uc9c0\uc6d0\uce35\uc744 \ud1b5\ud574 \uc0c1\uae30 \uc625\uc0c1 \ubc14\ub2e5\uba74\uc5d0 \uac00\uc555 \uace0\uc815\ub418\uc5b4 \uc0c1\uae30 \uac01 \uc9c0\uc9c0\ub300\ub82c\uad70\uc758 \uac01 \uc9c0\uc9c0\ub300\uc758 \uc0c1\ub2e8\uba74\uc5d0\ub294 \ud558\uc6b0\uc2a4 \uc7ac\ubc30\uc6a9 \uc544\uce58\ud615 \uace8\uc7ac\ub97c \uc9c0\uc9c0\ud558\uae30 \uc704\ud55c \uc9c0\uc9c0\ubd80\uac00 \uac01\uac01 \uad6c\ube44\ub418\uc5b4 \uc788\ub294, \uac83\uc744 \ud2b9\uc9d5\uc73c\ub85c \ud558\ub294 \uc625\uc0c1 \ub179\ud654 \uad6c\uc870.",
"from": "nayohan/aihub-en-ko-translation-12m",
"category": "full"
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"english": "Native chicken breeding methodThe invention discloses a native chicken breeding method, which includes steps that the shield degree of a breeding grove is 60-65%; a native chicken activity field with area of 5-8 mu is encircled bya 1.8-2.2m high nylon mesh; a ventilating and warming device is arranged in a henhouse; feed and water are delivered at 8: 00-15: 00 in every morning, and native chicken are put in grove activity field at 15:00-17: 00 in the afternoon; music is displayed at 17: 00-18: 30, and feed is delivered at outside of the henhouse to domesticize the native chickens , and then chickens are returned to the henhouse; the henhouse is cleaned at intervals of 12-15 days; the henhouse is sterilized by an automatic sterilizing system during the stocking period in the afternoon at intervals of 3-5 days. The native chicken breeding method can well consider about the stocking time, thus the stocking instinct of the native chickens is well guaranteed, the food intake of the native chickens is increased throughthe reasonable captive time; the meat growth is accelerated, the breeding cycle is shortened, and the meat quality of the native chickens is ensured.A kind of 1. cultural method of chicken, it is characterised in that\uff1ait the described method comprises the following steps\uff1a\uff081\uff09selection cultivation ground\uff1aselection away from livestock and poultry transaction place, slaughtering field, chemical plant, garbage disposal plant, avoid air, dust, water source, germ and the cultivation of the woods of noise pollution, the moon degree of covering of the woods is 60~65%, with 1.8~2.2 meters of high nylon net circle area is 5~8 mu of chicken playground, and vegetable seeds is broadcasted sowing in forest land\uff1b\uff082\uff09build chicken house\uff1athe wind sheltering in woods ground on the sunny side, hen house is built in the chicken playground centre position that physical features is high and dry, draining blowdown condition is good, and ventilation heating is set in hen house equipment, hen house is interior to set automatic sterilizing system\uff1b\uff083\uff09select kind\uff1aselect it is resistance to it is extensive, action flexibly, the pure native that power of looking for food is strong, premunition is strong\uff1b\uff084\uff09dietary management\uff1aevery mu of forest land puts 260~280 in a suitable place to breed, every morning 8:00~15:feed and water are launched in stable breeding when 00, afternoon 15:00~17:it is put into forest land playground when 00 to put in a suitable place to breed, 17:00~18:dispensing feed outside music colony house is played when 30 to enter row domestication makes chicken return to colony house, and day temperature is maintained at 20~23 degrees celsius in circle, and nocturnal temperature is maintained at 20~23 degrees celsius\uff1b \uff085\uff09disinfectant management\uff1ato being cleaned in hen house, colony house is started certainly during chicken is put in a suitable place to breed afternoon within every 3~5 days within every 12~15 days dynamic disinfection system is sterilized, and lime powder for every 2~3 months to the main passageway in woods forest land.",
"korean": "\ud1a0\uc885\ub2ed \uc0ac\uc721\ubc29\ubc95\uc774 \ubc1c\uba85\ud488\uc740 \uc0ac\uc721\uc7a5\uc758 \ubc29\ud328\ub3c4\uac00 60~65%\uc778 \ud1a0\uc885\ub2ed \uc0ac\uc721\ubc95\uc744 \uacf5\uac1c\ud558\uace0 \uc788\uc73c\uba70, \uba74\uc801\uc774 5~8m\uc778 \ud1a0\uc885\ub2ed \ud65c\ub3d9\uc7a5\uc744 1.8~2.2m \ub192\uc774\uc758 \ub098\uc77c\ub860 \uba54\uc2dc\ub85c \ub458\ub7ec\uc2f8\uace0 \uc788\uc73c\uba70, \ub2ed\uc7a5\uc5d0 \ud658\uae30 \ubc0f \ub09c\ubc29 \uc7a5\uce58\uac00 \ubc30\uce58\ub418\uc5b4 \uc788\uc73c\uba70, \ub9e4\uc77c \uc544\uce68 8\uc2dc~15\ubd84\uc5d0 \uc0ac\ub8cc\uc640 \ubb3c\uc774 \uc804\ub2ec\ub418\uace0 \uc788\ub2e4. \uadf8\ub9ac\uace0 \ud1a0\uc885\ub2ed\uc740 \uc624\ud6c4 15:00-17:00\uc5d0 \uc232 \ud65c\ub3d9\uc7a5\uc5d0 \ud22c\uc785\ub418\uace0, 17: 00-18:30\uc5d0\ub294 \uc74c\uc545\uc774 \uc5f0\uc8fc\ub418\uba70, \ubaa8\uc774\ub294 \ub2ed\uc7a5 \ubc16\uc5d0\uc11c \ubc30\ub2ec\uc744 \ubc1b\uc544 \ud1a0\uc885\ub2ed\uc744 \uae38\ub4e4\uc774\uace0, \ub2ed\uc7a5\uc740 12-15\uc77c \uac04\uaca9\uc73c\ub85c \ub2ed\uc7a5\uc73c\ub85c \ub3cc\ub824\ubcf4\ub0b8\ub2e4; \ub2ed\uc7a5\uc740 \uc790\ub3d9\uc18c\ub3c5\ub41c\ub2e4.c \uc624\ud6c4\uc758 \ubcf4\uad00 \uae30\uac04 \ub3d9\uc548 3~5\uc77c \uac04\uaca9\uc73c\ub85c \uba78\uade0 \uc2dc\uc2a4\ud15c. \ud1a0\uc885\ub2ed \uc0ac\uc721\ubc95\uc740 \uc0ac\uc721 \uc2dc\uac04\uc744 \uc798 \uace0\ub824\ud560 \uc218 \uc788\uae30 \ub54c\ubb38\uc5d0 \ud1a0\uc885\ub2ed\uc758 \uc0ac\uc721 \ubcf8\ub2a5\uc774 \uc798 \ubcf4\uc7a5\ub418\uace0, \ud1a0\uc885\ub2ed\uc758 \uba39\uc774 \uc12d\ucde8\uac00 \uc801\uc808\ud55c \ud3ec\ud68d \uc2dc\uac04\uc744 \ud1b5\ud574 \uc99d\uac00\ud55c\ub2e4; \uc721\uc2dd \uc131\uc7a5\uc774 \uac00\uc18d\ud654\ub418\uace0, \ubc88\uc2dd \uc8fc\uae30\uac00 \uc9e7\uc544\uc9c0\uba70, \ud1a0\uc885\ub2ed\uc758 \uc721\uc9c8\ub3c4 e\uc774\ub2e4.\ub204\uc5d0\uc288\uc5b4\ub2ed\uc758 \uc77c\uc885\uc73c\ub85c, \ubb18\uc0ac\ub41c \ubc29\ubc95\uc740 \ub2e4\uc74c\uacfc \uac19\uc740 \ub2e8\uacc4\ub85c \uad6c\uc131\ub41c\ub2e4: \uff091select\uc120\uc815\uc7ac\ubc30\uc7a5: \uac00\ucd95\uacfc \uac00\uae08\ub958 \uac70\ub798\uc7a5\uc18c\ub85c\ubd80\ud130\uc758 \uc120\ud0dd, \ub3c4\ucd95\uc7a5, \ud654\ud559\uacf5\uc7a5, \uc4f0\ub808\uae30 \ucc98\ub9ac\uc7a5, \uacf5\uae30, \uba3c\uc9c0, \uc218\uc6d0, \uc138\uade0, \uadf8\ub9ac\uace0 \uc232\uc758 \ubb34\uade0 \uc7ac\ubc30\uc774\uc138\uc624\uc5fc, \uc232\uc758 \ub2ec\uc758 \ub36e\uc784\ub3c4\ub294 60~65%\uc774\uace0, \ub192\uc740 \ub098\uc77c\ub860 \uadf8\ubb3c\ub9dd \uba74\uc801 1.8~2.2m\ub294 \ub2ed \ub180\uc774\ud130\uc758 5~8mu\uc774\uba70, \uc232 \uc18d\uc5d0 \ucc44\uc18c \uc528\uc557\uc744 \ubfcc\ub9ac\ub294 \uac83\uc744 \ubc29\uc1a1\ud55c\ub2e4. \uc2e0\uccb4\uc801 \ud2b9\uc9d5\uc774 \ub192\uace0 \uac74\uc870\ud558\uba70 \ubc30\uc218 \ube14\ub85c\uc6b0\ub2e4\uc6b4 \uc870\uac74\uc774 \uc88b\ub2e4, \uadf8\ub9ac\uace0 \ud658\uae30 \ub09c\ubc29\uc740 \ub2ed\uc9d1 \uc7a5\ube44\uc5d0 \uc124\uc815\ub41c\ub2e4, \ub2ed\uc9d1\uc740 \uc790\ub3d9 \uc0b4\uade0 \uc2dc\uc2a4\ud15c\uc744 \uc124\uc815\ud558\uae30 \uc704\ud55c \ub0b4\ubd80\uc774\ub2e4;33selectselect cind;select codelt it's \uad11\ubc94\uc704\ud558\uace0, \uc720\uc5f0\ud558\uac8c \uc791\uc6a9\ud558\uba70, \uc74c\uc2dd\uc744 \ucc3e\ub294 \ud798\uc774 \uac15\ud55c \uc21c\uc218\ud55c \ud1a0\uc885, \uc608\uac10\uc774 \uac15\ud558\ub2e4;select4aary \uad00\ub9ac:\uc784\uc57c\uc758 \ubaa8\ub4e0 \ubba4\ub294 260~280\ubc88\uc2dd\uc744 \ud558\uae30\uc5d0 \uc801\ud569\ud55c \uc7a5\uc18c\uc5d0 \ubc30\uce58\ud558\uace0, \ub9e4\uc77c \uc544\uce68 8:00~15:\uc0ac\ub8cc\uc640 \ubb3c\uc740 00\ubc88\uc2dd\uc744 \ud560 \ub54c \uc548\uc815\uc801\uc778 \ubc88\uc2dd\uc9c0\ub85c \ud22c\uc785\ud558\uace0, 17:00~18:\uc74c\uc545\uc9d1 \uc678\ubd80\uc758 \uc0ac\ub8cc\ub4e4\uc774 30\ubc88 \uc904\uc5d0 \ub4e4\uc5b4\uc11c\uba74 \uc7ac\uc0dd\ub429\ub2c8\ub2e4.\ub2ed\uc758 \uad70\uc9d1 \ubcf5\uadc0\ub294 \uc544\uc774\ub514\ucf00\uc774\uc158\uc73c\ub85c, \ub0ae \uae30\uc628\uc740 \uc6d0\uc8fc 20~23\ub3c4, \uc57c\ud589\uc131 \uc628\ub3c4\ub294 20~23\ub3c4\ub97c \uc720\uc9c0\ud558\uba70, \u30105\u3011\uc911\uc694\ud55c \uad00\ub9ac:\ub2ed\uc9d1 \uccad\uc18c\ub294 \ubc18\ub4dc\uc2dc \uc2dc\uc791\ud558\uba70, \ub2ed\uc740 3\ub144\ub9c8\ub2e4 \uc624\ud6c4\ub9c8\ub2e4 \ubc88\uc2dd\ud558\uae30\uc5d0 \uc801\ud569\ud55c \uc7a5\uc18c\uc5d0 \ub454\ub2e4.12~15\uc77c \uc774\ub0b4\uc5d0\ub294 5\uc77c \uc774\ub0b4 \ub3d9\uc801\uc18c\ub3c5\uc2dc\uc2a4\ud15c\uc774 \uba78\uade0 \ucc98\ub9ac\ub418\uba70, \uc232\uc18d\uc758 \uc8fc\ud1b5\ub85c\ub85c 2~3\uac1c\uc6d4\ub9c8\ub2e4 \ub77c\uc784\ud30c\uc6b0\ub354\uac00 \ud22c\uc785\ub41c\ub2e4.",
"from": "nayohan/aihub-en-ko-translation-12m",
"category": "full"
},
"truncated_cells": []
}
]
| +* Loss: [CachedMultipleNegativesRankingLoss](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) with these parameters: + ```json + { + "scale": 20.0, + "similarity_fct": "cos_sim" + } + ``` + +### Evaluation Dataset + +#### query-to-dataset-viewer-descriptions + +* Dataset: [query-to-dataset-viewer-descriptions](https://huggingface.co/datasets/davanstrien/query-to-dataset-viewer-descriptions) at [eb9d1be](https://huggingface.co/datasets/davanstrien/query-to-dataset-viewer-descriptions/tree/eb9d1becf412659d97049b5895ad5521f8015383) +* Size: 1,433 evaluation samples +* Columns: query, positive, and negative +* Approximate statistics based on the first 1000 samples: + | | query | positive | negative | + |:--------|:-----------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------| + | type | string | string | string | + | details |
  • min: 10 tokens
  • mean: 11.75 tokens
  • max: 17 tokens
|
  • min: 46 tokens
  • mean: 2188.82 tokens
  • max: 8192 tokens
|
  • min: 33 tokens
  • mean: 1769.64 tokens
  • max: 8192 tokens
| +* Samples: + | query | positive | negative | + |:--------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| + | USER_QUERY: multimodal conversation dataset | HUB_DATASET_PREVIEW: DATASET_NAME: "BUAADreamer/llava-en-zh-2k"
FEATURES: {'messages': [{'role': {'dtype': 'string', '_type': 'Value'}, 'content': {'dtype': 'string', '_type': 'Value'}}], 'images': {'feature': {'_type': 'Image'}, '_type': 'Sequence'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"messages": [
{
"role": "user",
"content": "How many baseball players are visible in the image?"
},
{
"role": "assistant",
"content": "There are three baseball players visible in the image."
},
{
"role": "user",
"content": "What are the players holding in their hands?"
},
{
"role": "assistant",
"content": "The players are holding baseball bats in their hands."
},
{
"role": "user",
"content": "Are the players in a dugout?"
},
{
"role": "assistant",
"content": "Yes, the three baseball players are standing in the dugout."
},
{
"role": "user",
"content": "Is the image in color or black and white?"
},
{
"role": "assistant",
"content": "The image is an old black and white photo of the three baseball players."
},
{
"role": "user",
"content": "Do the players belong to a specific baseball team?"
},
{
"role": "assistant",
"content": "Yes, the players belong to the Boston Red Sox baseball team."
}
],
"images": [
{
"src": "https://datasets-server.huggingface.co/assets/BUAADreamer/llava-en-zh-2k/--/fba994c834822bddd3cd79e929c33135f4289d2b/--/en/train/0/images/image-1d100e9.jpg?Expires=1726591851&Signature=QHGD147HyWamORfSjz0QoG51Ru86g3STPBNDAEOLK7NTq8Y~b4vVt3u~XF9njlRWwNnVF7AQ8-l9f2pCWxggPnZw1wZEfBAC5Q1oOW2CwT-gMYME~I-9qeJrbQtkszer9U0-H5rkECK0DVgWKKIagyjjJBmSSLz2QAb3BlV5aEMGEbJj-4G1X2xzoVaMIfg~ogGSb6~2PAKoLoKMPUZIxrm~2LoPEGVJeXr7JIa8-ISVukshW8ZEDQII~eXfOBeEsTd1Mk5BpJz26dH3W7X2rdhq54KeQ8H9OBt~HlU0XbXigFABeMymb-GBH-fDNHUDmb9Zt3io5hBMOd7I64DLyQ__&Key-Pair-Id=K3EI6M078Z3AC3",
"height": 308,
"width": 384
}
]
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"messages": [
{
"role": "user",
"content": "What is the man wearing in the image?"
},
{
"role": "assistant",
"content": "The man in the image is wearing chaps, a cowboy hat, and is dressed like a cowboy."
},
{
"role": "user",
"content": "What is the cowboy doing in the image?"
},
{
"role": "assistant",
"content": "The cowboy is riding on a horse in a grassy field and pointing into the distance, likely directing attention towards a specific location or object."
},
{
"role": "user",
"content": "What kind of landscape surrounds the cowboy and his horse?"
},
{
"role": "assistant",
"content": "The cowboy and his horse are surrounded by a grassy field, which creates a natural and open setting for the scene."
},
{
"role": "user",
"content": "What is the significance of the cowboy attire and its relation to horse riding?"
},
{
"role": "assistant",
"content": "Cowboy attire is deeply rooted in the history, culture, and traditions of the American West. It is both functional and symbolic, representing the hardworking and rugged lifestyle of cowboys who spent much of their time working with livestock and riding horses.\n\nThe key elements of cowboy attire, like the cowboy hat and chaps, serve specific purposes:\n1. Cowboy Hat: A wide-brimmed hat made of stiffened felt or straw is designed to protect the cowboy from the sun, wind, and rain. The hat's distinctive high crown allows for air circulation and provides a touch of style.\n2. Chaps: These are sturdy leather leg coverings worn over pants to protect the cowboy's legs from rough terrain, thorny bushes, and other elements while riding or working with livestock. Chaps provide an additional layer of protection and durability, particularly while riding through rugged landscapes and herding cattle.\n\nIn summary, the cowboy attire seen in the image is not just a fashion statement but is deeply connected to the history and traditions of the American West. The functional pieces of clothing, like the cowboy hat and chaps, are designed to protect and support the cowboy during horse riding and working with livestock."
}
],
"images": [
{
"src": "https://datasets-server.huggingface.co/assets/BUAADreamer/llava-en-zh-2k/--/fba994c834822bddd3cd79e929c33135f4289d2b/--/en/train/1/images/image-1d100e9.jpg?Expires=1726591851&Signature=WyNDGZXVbzPOU9iOQSDPFt1MizgmdT-KqdVAG8nIVSK0Gg8OO-qmhKxgIVjyWMHnWyNbW5svuMoukPMyv9hiHMsNh0YmzdjMR9Gwb6mRvsisEAdaLl71Q053MYxEqkZWCB6PbXG5yEazHL4RHvDphsUEhZS-0Yk8Kzx0HHc12HNaJfiO4fO4IPkY3eLw5xLgNoKIcvvO9TDo0JEbc1ej6YkxGUdqXyVrG2Y4zYnhrCM0drgKVzq24cQ9YZ78HW5f-EsXsftbj0ZzEg4SKcuVgrqaKG8SJ~i0aV-OtkXiTCWxW16D4hfsmpXZShZAHesa1EOGprkYdtQG4Kfte12maQ__&Key-Pair-Id=K3EI6M078Z3AC3",
"height": 288,
"width": 384
}
]
},
"truncated_cells": []
}
]
| NEGATIVE: DATASET_NAME: "passing2961/photochat_plus"
FEATURES: {'photo_description': {'dtype': 'string', '_type': 'Value'}, 'trigger_sentences': {'feature': {'dtype': 'string', '_type': 'Value'}, '_type': 'Sequence'}, 'dialogue_id': {'dtype': 'int64', '_type': 'Value'}, 'photo_url': {'dtype': 'string', '_type': 'Value'}, 'dialogue': [{'message': {'dtype': 'string', '_type': 'Value'}, 'share_photo': {'dtype': 'bool', '_type': 'Value'}, 'user_id': {'dtype': 'int64', '_type': 'Value'}}], 'image_descriptions': {'feature': {'dtype': 'string', '_type': 'Value'}, '_type': 'Sequence'}, 'intents': {'feature': {'dtype': 'string', '_type': 'Value'}, '_type': 'Sequence'}, 'salient_information': {'feature': {'dtype': 'string', '_type': 'Value'}, '_type': 'Sequence'}, 'photo_id': {'dtype': 'string', '_type': 'Value'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"photo_description": "The photo has your brother Kannon. Objects in the photo: Man",
"trigger_sentences": [
"How is Kannon doing?"
],
"dialogue_id": 500,
"photo_url": "https://farm6.staticflickr.com/151/369716968_bde7e83418_o.jpg",
"dialogue": [
{
"message": "Hello, how have you been, dear friend?",
"share_photo": false,
"user_id": 1
},
{
"message": "Great!",
"share_photo": false,
"user_id": 0
},
{
"message": "Thanks for asking",
"share_photo": false,
"user_id": 0
},
{
"message": "And how have you been?",
"share_photo": false,
"user_id": 0
},
{
"message": "It seems like we haven't talked in forever",
"share_photo": false,
"user_id": 0
},
{
"message": "I have been doing well, keeping busy, spent a lot of time outdoors. What have you been up to?",
"share_photo": false,
"user_id": 1
},
{
"message": "Last night my brother Kannon did a poetry reading",
"share_photo": false,
"user_id": 0
},
{
"message": "Really? How did it go? You know how much I love poetry.",
"share_photo": false,
"user_id": 1
},
{
"message": "It went really well",
"share_photo": false,
"user_id": 0
},
{
"message": "Do you remember my brother Kannon?",
"share_photo": false,
"user_id": 0
},
{
"message": "Absolutely! How could I forget, he left quite an impression",
"share_photo": false,
"user_id": 1
},
{
"message": "How is Kannon doing?",
"share_photo": false,
"user_id": 1
},
{
"message": "",
"share_photo": true,
"user_id": 0
},
{
"message": "Great",
"share_photo": false,
"user_id": 0
},
{
"message": "Here is a photo from last night",
"share_photo": false,
"user_id": 0
},
{
"message": "Wow, he seems so confident in that pic! Wish that I could have been there.",
"share_photo": false,
"user_id": 1
}
],
"image_descriptions": [
"A photo of Kannon",
"A picture of Kannon.",
"a photo of recent situation"
],
"intents": [
"Information Dissemination",
"Social Bonding"
],
"salient_information": [
"poetry",
"How is Kannon doing?",
"Kannon doing"
],
"photo_id": "train/19e8f436d4b2fc25"
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"photo_description": "The photo has your uncle Kieran. Objects in the photo: Clothing, Man",
"trigger_sentences": [
"guess what new animal he got?",
"He's always had goats and chickens, but guess what new animal he got?"
],
"dialogue_id": 501,
"photo_url": "https://farm8.staticflickr.com/53/189664134_f70fc8947a_o.jpg",
"dialogue": [
{
"message": "Hey! You remember my uncle who owns the hobby farm, right?",
"share_photo": false,
"user_id": 0
},
{
"message": "Yeah i do",
"share_photo": false,
"user_id": 1
},
{
"message": "Uncle Keiran?",
"share_photo": false,
"user_id": 0
},
{
"message": "How about him?",
"share_photo": false,
"user_id": 1
},
{
"message": "He's always had goats and chickens, but guess what new animal he got?",
"share_photo": false,
"user_id": 0
},
{
"message": "Dog?",
"share_photo": false,
"user_id": 1
},
{
"message": "Nope, a wild hog!",
"share_photo": false,
"user_id": 0
},
{
"message": "And not the motorcycle kind ;)",
"share_photo": false,
"user_id": 0
},
{
"message": "",
"share_photo": true,
"user_id": 0
},
{
"message": "Wow",
"share_photo": false,
"user_id": 1
}
],
"image_descriptions": [
"A photo of the hog's appearance.",
"a photo of wild hog",
"An image of the new wild hog"
],
"intents": [
"Social Bonding",
"Visual Clarification"
],
"salient_information": [
"hog",
"not the motorcycle kind",
"wild hog",
"a wild hog"
],
"photo_id": "train/07d688f5e2142b87"
},
"truncated_cells": []
}
]
| + | USER_QUERY: question answering dataset for technical documentation | HUB_DATASET_PREVIEW: DATASET_NAME: "hythyt/preguntas-respuestas-RAG"
FEATURES: {'DATOS': {'dtype': 'string', '_type': 'Value'}, 'PREGUNTA': {'dtype': 'string', '_type': 'Value'}, 'RESPUESTA': {'dtype': 'string', '_type': 'Value'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"DATOS": "ARISbyhyt Manual de Usuario\nSecci\u00f3n 8\nARISbyhyt es un programa dise\u00f1ado para gestionar proyectos de forma eficiente. Permite crear tareas, asignar recursos y realizar un seguimiento detallado del progreso. Para agregar una nueva tarea, accede al men\u00fa \"Proyectos\" y selecciona \"Nueva Tarea\". Completa la informaci\u00f3n requerida, como nombre, fecha de inicio y asignaci\u00f3n de recursos.\n\nSecci\u00f3n 4\nEn ARISbyhyt, puedes personalizar tus informes para obtener datos espec\u00edficos sobre el progreso del proyecto. Accede al men\u00fa \"Informes\", elige la opci\u00f3n \"Personalizar Informe\" y selecciona los campos que deseas incluir, como el estado de las tareas o el tiempo dedicado a cada fase.",
"PREGUNTA": "\u00bfC\u00f3mo puedo crear una nueva tarea en ARISbyhyt?",
"RESPUESTA": "Para agregar una nueva tarea en ARISbyhyt, sigue estos pasos:\n1. Ve al men\u00fa \"Proyectos\".\n2. Selecciona \"Nueva Tarea\".\n3. Completa la informaci\u00f3n solicitada, como nombre, fecha de inicio y asignaci\u00f3n de recursos."
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"DATOS": "ARISbyhyt Manual de Usuario \nSecci\u00f3n 2\nEn ARISbyhyt, puedes personalizar tus informes para obtener datos espec\u00edficos sobre el progreso del proyecto. Accede al men\u00fa \"Informes\", elige la opci\u00f3n \"Personalizar Informe\" y selecciona los campos que deseas incluir, como el estado de las tareas o el tiempo dedicado a cada fase.",
"PREGUNTA": "\u00bfC\u00f3mo puedo personalizar un informe en ARISbyhyt para obtener datos espec\u00edficos sobre el progreso del proyecto?",
"RESPUESTA": "Para personalizar un informe en ARISbyhyt, sigue estos pasos:\n1. Dir\u00edgete al men\u00fa \"Informes\".\n2. Selecciona \"Personalizar Informe\".\n3. Elige los campos que deseas incluir, como el estado de las tareas o el tiempo dedicado a cada fase."
},
"truncated_cells": []
}
]
| NEGATIVE: DATASET_NAME: "cmalaviya/expertqa"
FEATURES: {'example_id': {'dtype': 'int64', '_type': 'Value'}, 'context': {'dtype': 'string', '_type': 'Value'}, 'question': {'dtype': 'string', '_type': 'Value'}, 'answer': {'dtype': 'string', '_type': 'Value'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"example_id": 0,
"context": "",
"question": "Some customers of mine are not paying their debts on time. Do I have to keep all my customers?",
"answer": "You don't necessarily have to keep all your customers, especially if they consistently fail to pay their debts on time. There are different types of non-paying customers, such as cash-strapped, purposefully late, and non-payer by nature . It is essential to maintain a positive attitude and treat your customers with respect while trying to collect their debts . However, if you consistently face issues with particular customers not paying their debts, you may opt to discontinue providing services or products to them and focus on other reliable customers. You may need to consult a professional debt collector or a business attorney in such cases to decide the appropriate next steps in debt collections . To prevent nonpayment issues in the future, you can implement various strategies, such as researching new prospects, being clear with your payment policies, and setting up contracts detailing payment expectations and late fees ."
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"example_id": 1,
"context": "",
"question": "When accounts are faced with ethical dilemmas that often bring their integrity into question, the question is whether they are equipped enough tp deal with those?",
"answer": " The context provided does not give specific information on whether accountants are adequately equipped to handle ethical dilemmas that could question their integrity. The text does suggest, however, that when faced with an ethical dilemma, one must question the situation honestly and transparently. And, if doubts persist, they have the obligation to raise these questions with those in authority . This suggests the need for a strong understanding of ethics to navigate such situations. The text also implies a correlation between integrity and ethics stating, \"Integrity can be measured by ethics\" . In a broader perspective, the text suggests that professionals, like nurses for example, often face dilemmas uncommon to the general populace . Due to the rapid advancement in medical technology, the study of ethics has become increasingly relevant, indicating that equipping professionals with adequate knowledge in ethics is necessary to navigate the demands and challenges of their roles effectively . Furthermore, it shows that managers grapple with ethical decisions involving questions of morality and integrity especially in situations where prior decisions by other managers create ethical dilemmas . While this analysis provides general insights on the significance of ethical decision-making and the need for professionals to navigate ethical dilemmas effectively, it does not provide a specific commentary on the readiness or the adequacy of training or framework available to accountants to deal with such scenarios. Hence, it is not possible to definitively answer the question based on the context provided. In South Africa SAICA has equipped accountants with code of professional conduct that they should follow when faced with ethical dilemmas. the code gives them guidance on how to deal with those. SAICA code of professional conduct https://www.misti.com/internal-audit-insights/ethics-and-the-internal-auditor Ethics and the Internal Auditor freely express these thoughts and ideas, the culture may be sending the wrong message. When you are personally faced with an ethical dilemma, you must ask yourself whether you are looking at the situation as honestly and transparently as you can. If questions still arise, it is your obligation to raise those questions to individuals in positions of responsibility. Integrity can be measured by ethics If someone had you name the top three people in history that you felt displayed unquestionable integrity, would those same individuals measure high on the ethics scale? Most likely they would. Integrity is adherence to https://misti.com/internal-audit-insights/ethics-and-the-internal-auditor Ethics and the Internal Auditor thoughts and ideas, the culture may be sending the wrong message. When you are personally faced with an ethical dilemma, you must ask yourself whether you are looking at the situation as honestly and transparently as you can. If questions still arise, it is your obligation to raise those questions to individuals in positions of responsibility. Integrity can be measured by ethics If someone had you name the top three people in history that you felt displayed unquestionable integrity, would those same individuals measure high on the ethics scale? Most likely they would. Integrity is adherence to a moral code, https://www.misti.co.uk/internal-audit-insights/ethics-and-the-internal-auditor Ethics and the Internal Auditor wrong message. When you are personally faced with an ethical dilemma, you must ask yourself whether you are looking at the situation as honestly and transparently as you can. If questions still arise, it is your obligation to raise those questions to individuals in positions of responsibility. Integrity can be measured by ethics If someone had you name the top 3 people in history that you felt displayed unquestionable integrity, would those same individuals measure high on the ethics scale? Most likely they would. Integrity is adherence to a moral code, reflected in honesty and harmony in what one thinks, SAICA equip accountants with all the relevant information in order to be able to identify ethical dilemmas https://www.misti.com/internal-audit-insights/ethics-and-the-internal-auditor Ethics and the Internal Auditor freely express these thoughts and ideas, the culture may be sending the wrong message. When you are personally faced with an ethical dilemma, you must ask yourself whether you are looking at the situation as honestly and transparently as you can. If questions still arise, it is your obligation to raise those questions to individuals in positions of responsibility. Integrity can be measured by ethics If someone had you name the top three people in history that you felt displayed unquestionable integrity, would those same individuals measure high on the ethics scale? Most likely they would. Integrity is adherence to https://misti.com/internal-audit-insights/ethics-and-the-internal-auditor Ethics and the Internal Auditor thoughts and ideas, the culture may be sending the wrong message. When you are personally faced with an ethical dilemma, you must ask yourself whether you are looking at the situation as honestly and transparently as you can. If questions still arise, it is your obligation to raise those questions to individuals in positions of responsibility. Integrity can be measured by ethics If someone had you name the top three people in history that you felt displayed unquestionable integrity, would those same individuals measure high on the ethics scale? Most likely they would. Integrity is adherence to a moral code, https://www.misti.co.uk/internal-audit-insights/ethics-and-the-internal-auditor Ethics and the Internal Auditor wrong message. When you are personally faced with an ethical dilemma, you must ask yourself whether you are looking at the situation as honestly and transparently as you can. If questions still arise, it is your obligation to raise those questions to individuals in positions of responsibility. Integrity can be measured by ethics If someone had you name the top 3 people in history that you felt displayed unquestionable integrity, would those same individuals measure high on the ethics scale? Most likely they would. Integrity is adherence to a moral code, reflected in honesty and harmony in what one thinks, https://www.bartleby.com/essay/The-Ethical-Dilemma-Of-A-Family-Nurse-F3H66JS4CPLLX The Ethical Dilemma Of A Family Nurse Practitioner | Bartleby or external factors. Due to the increased complexity of the health system, nowadays nurses are faced with ethical and legal decisions and often come across dilemmas regarding patient care. From this perspective a good question to be raised would be whether or not nurses have the necessary background, knowledge and skills to make appropriate Ethics : Ethics And Ethics professionals who often face dilemmas that are not experienced by the general population. The fast-paced growth of medical technology has made the study of ethics even more relevant. The study of bioethics, or biomedical ethics, refers to moral dilemmas due to https://www.bartleby.com/essay/The-Ethical-Dilemma-Of-A-Family-Nurse-F3H66JS4CPLLX The Ethical Dilemma Of A Family Nurse Practitioner | Bartleby or external factors. Due to the increased complexity of the health system, nowadays nurses are faced with ethical and legal decisions and often come across dilemmas regarding patient care. From this perspective a good question to be raised would be whether or not nurses have the necessary background, knowledge and skills to make appropriate Ethics : Ethics And Ethics professionals who often face dilemmas that are not experienced by the general population. The fast-paced growth of medical technology has made the study of ethics even more relevant. The study of bioethics, or biomedical ethics, refers to moral dilemmas due to http://projectmanager.com.au/ethics-governance-individual-project-manager/ \u00bb Ethics, governance and the individual project manager Project Manager if the payments were not made and as a consequence the mine was not adequately protected leading to equipment damaged and mine workers being attacked and injured, who would be to blame? The managers were faced with a classic dilemma. At the personal level, making the right ethical decisions can be difficult and involve questions of morality and integrity. Your decision becomes more difficult when it involves dealing with a situation created by earlier decisions made by other managers working for the organisation. The managers who decided to make payments to Indonesian officials that may contravene the Bribery of Foreign SAICA equip accountants with all the relevant information in order to be able to identify ethical dilemmas SAICA equip accountants with all the relevant information in order to be able to identify ethical dilemmas"
},
"truncated_cells": []
}
]
| + | USER_QUERY: conversational ai training dataset | HUB_DATASET_PREVIEW: DATASET_NAME: "dataautogpt3/flan1m-alpaca-uncensored"
FEATURES: {'instruction': {'dtype': 'string', '_type': 'Value'}, 'input': {'dtype': 'string', '_type': 'Value'}, 'output': {'dtype': 'string', '_type': 'Value'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"instruction": "You are a helpful assistant, who always provide explanation. Think like you are answering to a five year old.",
"input": "Continue the following story.\n\nEmily held and rocked Kayla as they sobbed because \nSelect from the following.\n +Emily really needed a good friend.\n +Kayla really needed a good friend.",
"output": "Once upon a time, Emily held and rocked Kayla as they both sobbed because Kayla really needed a good friend. You see, little Kayla was feeling very lonely and sad because she didn't have anyone to play with or talk to.\n\nEmily wanted to help Kayla and be her friend, so she did something truly special. Emily decided to throw a magical friendship party for Kayla. Emily knew that this would cheer her up and help her make some new friends!\n\nEmily worked really hard, preparing for the big day. She bought balloons, baked cookies, and even created a treasure hunt for everyone to enjoy. She invited lots of other kids from their neighborhood, so Kayla could meet them all and find a friend to play with.\n\nOn the day of the party, the sun was shining, and the sky was filled with fluffy clouds. All the kids in the neighborhood were excited to come to the party and play with Kayla.\n\nEmily and Kayla welcomed their guests with huge smiles and warm hugs. The kids played games, laughed, and explored the treasure hunt that Emily had set up all around the backyard.\n\nSoon, Kayla was surrounded by new friends, chatting and laughing together. She began to feel so happy and loved. As the sun began to set and the party ended, Kayla felt like she finally had found the friends she needed.\n\nThat night, as Emily tucked Kayla into bed, Kayla thanked her for being such an amazing friend and throwing the best party ever. Emily smiled and said, \"That's what friends are for, to help each other when we're feeling sad and lonely.\"\n\nAnd from that day on, Emily and Kayla were the best of friends and shared many more magical adventures together. Now, Kayla knew that she always had a good friend by her side."
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"instruction": "You are an AI assistant that helps people find information. User will you give you a question. Your task is to answer as faithfully as you can. While answering think step-bystep and justify your answer.",
"input": "For this chain-of-thought reasoning and answer, what was the question?\nA boy playing ball is not necessarily not very good at playing ball. There is no way to know what level of skill the boy has.\n A: it is not possible to tell",
"output": "The question likely was:\n\n\"Does a boy playing ball automatically mean he is not very good at playing ball?\""
},
"truncated_cells": []
}
]
| NEGATIVE: DATASET_NAME: "sgp-bench/open-instruct-gpt4o_40k"
FEATURES: {'instruction': {'dtype': 'string', '_type': 'Value'}, 'input': {'dtype': 'string', '_type': 'Value'}, 'output': {'dtype': 'string', '_type': 'Value'}}
DATA SAMPLE:
[
{
"row_idx": 0,
"row": {
"instruction": "Can you tell me how cookies work for tracking user data?",
"input": "",
"output": "Sure! Cookies are small pieces of data that websites can store on your computer, allowing them to track your activities and improve the user experience. For example, cookies can remember your preferences, keep track of your shopping history, or save information you enter into forms. They\u2019re typically stored in a file on your hard drive, and you can configure your browser to block or delete cookies, or\n"
},
"truncated_cells": []
},
{
"row_idx": 1,
"row": {
"instruction": "Given the sentence \"White fluffy dog hangs out of the window of a turquoise building.\" is it true that \"The dog is indoors.\"? Let's be accurate as possible and think first.",
"input": "",
"output": "A dog being in a turquoise building is the same as a dog being indoors. Final answer: yes."
},
"truncated_cells": []
}
]
| +* Loss: [CachedMultipleNegativesRankingLoss](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) with these parameters: + ```json + { + "scale": 20.0, + "similarity_fct": "cos_sim" + } + ``` + +### Training Hyperparameters +#### Non-Default Hyperparameters + +- `eval_strategy`: steps +- `per_device_train_batch_size`: 4 +- `per_device_eval_batch_size`: 4 +- `num_train_epochs`: 200 +- `warmup_ratio`: 0.1 +- `bf16`: True +- `load_best_model_at_end`: True +- `batch_sampler`: no_duplicates + +#### All Hyperparameters +
Click to expand + +- `overwrite_output_dir`: False +- `do_predict`: False +- `eval_strategy`: steps +- `prediction_loss_only`: True +- `per_device_train_batch_size`: 4 +- `per_device_eval_batch_size`: 4 +- `per_gpu_train_batch_size`: None +- `per_gpu_eval_batch_size`: None +- `gradient_accumulation_steps`: 1 +- `eval_accumulation_steps`: None +- `torch_empty_cache_steps`: None +- `learning_rate`: 5e-05 +- `weight_decay`: 0.0 +- `adam_beta1`: 0.9 +- `adam_beta2`: 0.999 +- `adam_epsilon`: 1e-08 +- `max_grad_norm`: 1.0 +- `num_train_epochs`: 200 +- `max_steps`: -1 +- `lr_scheduler_type`: linear +- `lr_scheduler_kwargs`: {} +- `warmup_ratio`: 0.1 +- `warmup_steps`: 0 +- `log_level`: passive +- `log_level_replica`: warning +- `log_on_each_node`: True +- `logging_nan_inf_filter`: True +- `save_safetensors`: True +- `save_on_each_node`: False +- `save_only_model`: False +- `restore_callback_states_from_checkpoint`: False +- `no_cuda`: False +- `use_cpu`: False +- `use_mps_device`: False +- `seed`: 42 +- `data_seed`: None +- `jit_mode_eval`: False +- `use_ipex`: False +- `bf16`: True +- `fp16`: False +- `fp16_opt_level`: O1 +- `half_precision_backend`: auto +- `bf16_full_eval`: False +- `fp16_full_eval`: False +- `tf32`: None +- `local_rank`: 0 +- `ddp_backend`: None +- `tpu_num_cores`: None +- `tpu_metrics_debug`: False +- `debug`: [] +- `dataloader_drop_last`: False +- `dataloader_num_workers`: 0 +- `dataloader_prefetch_factor`: None +- `past_index`: -1 +- `disable_tqdm`: False +- `remove_unused_columns`: True +- `label_names`: None +- `load_best_model_at_end`: True +- `ignore_data_skip`: False +- `fsdp`: [] +- `fsdp_min_num_params`: 0 +- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False} +- `fsdp_transformer_layer_cls_to_wrap`: None +- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None} +- `deepspeed`: None +- `label_smoothing_factor`: 0.0 +- `optim`: adamw_torch +- `optim_args`: None +- `adafactor`: False +- `group_by_length`: False +- `length_column_name`: length +- `ddp_find_unused_parameters`: None +- `ddp_bucket_cap_mb`: None +- `ddp_broadcast_buffers`: False +- `dataloader_pin_memory`: True +- `dataloader_persistent_workers`: False +- `skip_memory_metrics`: True +- `use_legacy_prediction_loop`: False +- `push_to_hub`: False +- `resume_from_checkpoint`: None +- `hub_model_id`: None +- `hub_strategy`: every_save +- `hub_private_repo`: False +- `hub_always_push`: False +- `gradient_checkpointing`: False +- `gradient_checkpointing_kwargs`: None +- `include_inputs_for_metrics`: False +- `eval_do_concat_batches`: True +- `fp16_backend`: auto +- `push_to_hub_model_id`: None +- `push_to_hub_organization`: None +- `mp_parameters`: +- `auto_find_batch_size`: False +- `full_determinism`: False +- `torchdynamo`: None +- `ray_scope`: last +- `ddp_timeout`: 1800 +- `torch_compile`: False +- `torch_compile_backend`: None +- `torch_compile_mode`: None +- `dispatch_batches`: None +- `split_batches`: None +- `include_tokens_per_second`: False +- `include_num_input_tokens_seen`: False +- `neftune_noise_alpha`: None +- `optim_target_modules`: None +- `batch_eval_metrics`: False +- `eval_on_start`: False +- `eval_use_gather_object`: False +- `batch_sampler`: no_duplicates +- `multi_dataset_batch_sampler`: proportional + +
+ +### Training Logs +| Epoch | Step | Training Loss | loss | max_accuracy | +|:----------:|:--------:|:-------------:|:----------:|:------------:| +| 0 | 0 | - | - | 0.5 | +| 0.3497 | 100 | 1.0509 | 0.7070 | - | +| 0.6993 | 200 | 0.6183 | 0.3396 | - | +| 1.0490 | 300 | 0.3746 | 0.2282 | - | +| 1.3986 | 400 | 0.2481 | 0.1616 | - | +| 1.7483 | 500 | 0.2198 | 0.1302 | - | +| 2.0979 | 600 | 0.166 | 0.1164 | - | +| 2.4476 | 700 | 0.1045 | 0.1174 | - | +| 2.7972 | 800 | 0.0797 | 0.1095 | - | +| 3.1469 | 900 | 0.0422 | 0.1176 | - | +| 3.4965 | 1000 | 0.0595 | 0.1115 | - | +| 3.8462 | 1100 | 0.0416 | 0.1008 | - | +| 4.1958 | 1200 | 0.0174 | 0.1233 | - | +| 4.5455 | 1300 | 0.0273 | 0.1032 | - | +| 4.8951 | 1400 | 0.0389 | 0.0990 | - | +| **5.2448** | **1500** | **0.0126** | **0.0963** | **-** | +| 5.5944 | 1600 | 0.0074 | 0.1193 | - | +| 5.9441 | 1700 | 0.0165 | 0.1379 | - | +| 6.2937 | 1800 | 0.0046 | 0.1127 | - | +| 6.6434 | 1900 | 0.0158 | 0.1289 | - | +| 6.9930 | 2000 | 0.0157 | 0.1009 | - | +| 7.3427 | 2100 | 0.0032 | 0.1075 | - | +| 7.6923 | 2200 | 0.0072 | 0.1289 | - | +| 8.0420 | 2300 | 0.0192 | 0.1176 | - | +| 8.3916 | 2400 | 0.001 | 0.1214 | - | +| 8.7413 | 2500 | 0.024 | 0.1320 | 1.0 | + +* The bold row denotes the saved checkpoint. + +### Framework Versions +- Python: 3.10.12 +- Sentence Transformers: 3.1.0 +- Transformers: 4.44.2 +- PyTorch: 2.4.0+cu121 +- Accelerate: 0.34.2 +- Datasets: 3.0.0 +- Tokenizers: 0.19.1 + +## Citation + +### BibTeX + +#### Sentence Transformers +```bibtex +@inproceedings{reimers-2019-sentence-bert, + title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks", + author = "Reimers, Nils and Gurevych, Iryna", + booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing", + month = "11", + year = "2019", + publisher = "Association for Computational Linguistics", + url = "https://arxiv.org/abs/1908.10084", +} +``` + +#### CachedMultipleNegativesRankingLoss +```bibtex +@misc{gao2021scaling, + title={Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup}, + author={Luyu Gao and Yunyi Zhang and Jiawei Han and Jamie Callan}, + year={2021}, + eprint={2101.06983}, + archivePrefix={arXiv}, + primaryClass={cs.LG} +} +``` + + + + + + \ No newline at end of file