Le champ SQL SELECT WHERE contient des mots

Question

J'ai besoin d'un select qui retournerait des résultats comme ceci:

SELECT * FROM MyTable WHERE Column1 CONTAINS 'Word1 Word2 Word3'

Et j’ai besoin de tous les résultats, c’est-à-dire que cela inclut les chaînes avec 'Word2 Word3 Word1' ou 'Word1 Word3 Word2' ou toute autre combinaison des trois.

Tous les mots doivent figurer dans le résultat.

mvp · Accepted Answer

Plutôt lent, mais méthode de travail pour inclure n'importe quel de mots:

SELECT * FROM mytable WHERE column1 LIKE '%Word1%' OR column1 LIKE '%Word2%' OR column1 LIKE '%Word3%'

Si vous avez besoin de tous mots, utilisez ceci:

SELECT * FROM mytable WHERE column1 LIKE '%Word1%' AND column1 LIKE '%Word2%' AND column1 LIKE '%Word3%'

Si vous voulez quelque chose de plus rapide, vous devez examiner la recherche en texte intégral, ce qui est très spécifique pour chaque type de base de données.

Sam · Answer

Notez que si vous utilisez LIKE pour déterminer si une chaîne est une sous-chaîne d'une autre chaîne, vous devez échapper les caractères correspondant au modèle dans votre chaîne de recherche.

Si votre dialecte SQL supporte CHARINDEX, son utilisation est beaucoup plus simple:

SELECT * FROM MyTable WHERE CHARINDEX('Word1', Column1) > 0 AND CHARINDEX('Word2', Column1) > 0 AND CHARINDEX('Word3', Column1) > 0

De plus, n'oubliez pas que ceci et la méthode dans la réponse acceptée couvrent uniquement la correspondance de sous-chaîne plutôt que la correspondance de Word. Ainsi, par exemple, la chaîne 'Word1word2Word3' devrait toujours correspondre.

Eduardo Cuomo · Answer

Une fonction

 CREATE FUNCTION [dbo].[fnSplit] ( @sep CHAR(1), @str VARCHAR(512) ) RETURNS TABLE AS RETURN ( WITH Pieces(pn, start, stop) AS ( SELECT 1, 1, CHARINDEX(@sep, @str) UNION ALL SELECT pn + 1, stop + 1, CHARINDEX(@sep, @str, stop + 1) FROM Pieces WHERE stop > 0 ) SELECT pn AS Id, SUBSTRING(@str, start, CASE WHEN stop > 0 THEN stop - start ELSE 512 END) AS Data FROM Pieces )

Question

 DECLARE @FilterTable TABLE (Data VARCHAR(512)) INSERT INTO @FilterTable (Data) SELECT DISTINCT S.Data FROM fnSplit(' ', 'Word1 Word2 Word3') S -- Contains words SELECT DISTINCT T.* FROM MyTable T INNER JOIN @FilterTable F1 ON T.Column1 LIKE '%' + F1.Data + '%' LEFT JOIN @FilterTable F2 ON T.Column1 NOT LIKE '%' + F2.Data + '%' WHERE F2.Data IS NULL

messed-up · Answer

Au lieu de SELECT * FROM MyTable WHERE Column1 CONTAINS 'Word1 Word2 Word3', Add Et entre ces mots tels que:

SELECT * FROM MyTable WHERE Column1 CONTAINS 'Word1 And Word2 And Word3'

pour plus de détails, voir ici https://msdn.Microsoft.com/en-us/library/ms187787.aspx

METTRE &AGRAVE; JOUR

Pour sélectionner des phrases, utilisez des guillemets tels que:

SELECT * FROM MyTable WHERE Column1 CONTAINS '"Phrase one" And Word2 And "Phrase Two"'

p.s. vous devez d'abord activer la recherche en texte intégral sur la table avant d'utiliser le mot-clé contient. pour plus de détails, voir ici https://docs.Microsoft.com/en-us/sql/relational-databases/search/get-started-with-full-text-search

Jon Crowell · Answer

SELECT * FROM MyTable WHERE Column1 LIKE '%Word1%' AND Column1 LIKE '%Word2%' AND Column1 LIKE '%Word3%'

OR remplacé par AND en fonction des modifications apportées à la question.

mirmdasif · Answer

Si vous utilisez Oracle Database, vous pouvez y parvenir en utilisant includes query. Contient les requêtes sont plus rapides que la même requête.

Si vous avez besoin de tous les mots

SELECT * FROM MyTable WHERE CONTAINS(Column1,'Word1 and Word2 and Word3', 1) > 0

Si vous avez besoin de l'un des mots

SELECT * FROM MyTable WHERE CONTAINS(Column1,'Word1 or Word2 or Word3', 1) > 0

Contient l'index de besoin de type CONTEXT sur votre colonne.

CREATE INDEX SEARCH_IDX ON MyTable(Column) INDEXTYPE IS CTXSYS.CONTEXT

Joshua Balan · Answer

Si vous voulez juste trouver une correspondance.

SELECT * FROM MyTable WHERE INSTR('Word1 Word2 Word3',Column1)<>0

Serveur SQL :

CHARINDEX(Column1, 'Word1 Word2 Word3', 1)<>0

Pour obtenir une correspondance exacte. Exemple (';a;ab;ac;',';b;') n'obtiendra pas de correspondance.

SELECT * FROM MyTable WHERE INSTR(';Word1;Word2;Word3;',';'||Column1||';')<>0

Daryl Arenas · Answer

essayez d'utiliser la "recherche tesarus" dans l'index de texte intégral dans MS SQL Server. C'est beaucoup mieux que d'utiliser "%" dans la recherche si vous avez des millions d'enregistrements. tesarus utilise une petite quantité de mémoire que les autres . essayez de rechercher cette fonction :)

Jino · Answer

SELECT * FROM MyTable WHERE Column1 Like "*Word*"

Cela affichera tous les enregistrements où column1 a une valeur partielle contenant Word.

MiladAhmadi · Answer

le meilleur moyen est de créer un index de texte intégral sur une colonne de la table et d’utiliser le contenu au lieu de LIKE

SELECT * FROM MyTable WHERE contains(Column1 , N'Word1' ) AND contains(Column1 , N'Word2' ) AND contains(Column1 , N'Word3' )

Anastasios Selmanis · Answer

L'un des moyens les plus simples de réaliser ce qui est mentionné dans la question consiste à utiliser CONTIENT avec NEAR ou '~'. Par exemple, les requêtes suivantes nous donneraient toutes les colonnes qui incluent spécifiquement Word1, Word2 et Word3.

SELECT * FROM MyTable WHERE CONTAINS(Column1, 'Word1 NEAR Word2 NEAR Word3') SELECT * FROM MyTable WHERE CONTAINS(Column1, 'Word1 ~ Word2 ~ Word3')

En outre, CONTAINSTABLE renvoie un classement pour chaque document en fonction de la proximité de "Word1", "Word2" et "Word3". Par exemple, si un document contient la phrase "Le mot Word1 est Word2 et Word3", son classement sera élevé, car les termes sont plus proches les uns des autres que dans d'autres documents.

Une autre chose que je voudrais ajouter est que nous pouvons également utiliser proximity_term pour rechercher des colonnes dans lesquelles les mots se trouvent à une distance spécifique de ceux-ci dans la phrase de colonne.

JBelfort · Answer

Cela devrait idéalement être effectué à l'aide de la recherche en texte intégral du serveur SQL si vous utilisez . Cependant, si vous ne parvenez pas à utiliser votre base de données pour une raison quelconque, voici une solution intensive en termes de performances: -

-- table to search in CREATE TABLE dbo.myTable ( myTableId int NOT NULL IDENTITY (1, 1), code varchar(200) NOT NULL, description varchar(200) NOT NULL -- this column contains the values we are going to search in ) ON [PRIMARY] GO -- function to split space separated search string into individual words CREATE FUNCTION [dbo].[fnSplit] (@StringInput nvarchar(max), @Delimiter nvarchar(1)) RETURNS @OutputTable TABLE ( id nvarchar(1000) ) AS BEGIN DECLARE @String nvarchar(100); WHILE LEN(@StringInput) > 0 BEGIN SET @String = LEFT(@StringInput, ISNULL(NULLIF(CHARINDEX(@Delimiter, @StringInput) - 1, -1), LEN(@StringInput))); SET @StringInput = SUBSTRING(@StringInput, ISNULL(NULLIF(CHARINDEX ( @Delimiter, @StringInput ), 0 ), LEN ( @StringInput) ) + 1, LEN(@StringInput)); INSERT INTO @OutputTable (id) VALUES (@String); END; RETURN; END; GO -- this is the search script which can be optionally converted to a stored procedure /function declare @search varchar(max) = 'infection upper acute genito'; -- enter your search string here -- the searched string above should give rows containing the following -- infection in upper side with acute genitointestinal tract -- acute infection in upper teeth -- acute genitointestinal pain if (len(trim(@search)) = 0) -- if search string is empty, just return records ordered alphabetically begin select 1 as Priority ,myTableid, code, Description from myTable order by Description return; end declare @splitTable Table( wordRank int Identity(1,1), -- individual words are assinged priority order (in order of occurence/position) Word varchar(200) ) declare @nonWordTable Table( -- table to trim out auxiliary verbs, prepositions etc. from the search id varchar(200) ) insert into @nonWordTable values ('of'), ('with'), ('at'), ('in'), ('for'), ('on'), ('by'), ('like'), ('up'), ('off'), ('near'), ('is'), ('are'), (','), (':'), (';') insert into @splitTable select id from dbo.fnSplit(@search,' '); -- this function gives you a table with rows containing all the space separated words of the search like in this e.g., the output will be - -- id ------------- -- infection -- upper -- acute -- genito delete s from @splitTable s join @nonWordTable n on s.Word = n.id; -- trimming out non-words here declare @countOfSearchStrings int = (select count(Word) from @splitTable); -- count of space separated words for search declare @highestPriority int = POWER(@countOfSearchStrings,3); with plainMatches as ( select myTableid, @highestPriority as Priority from myTable where Description like @search -- exact matches have highest priority union select myTableid, @highestPriority-1 as Priority from myTable where Description like @search + '%' -- then with something at the end union select myTableid, @highestPriority-2 as Priority from myTable where Description like '%' + @search -- then with something at the beginning union select myTableid, @highestPriority-3 as Priority from myTable where Description like '%' + @search + '%' -- then if the Word falls somewhere in between ), splitWordMatches as( -- give each searched Word a rank based on its position in the searched string -- and calculate its char index in the field to search select myTable.myTableid, (@countOfSearchStrings - s.wordRank) as Priority, s.Word, wordIndex = CHARINDEX(s.Word, myTable.Description) from myTable join @splitTable s on myTable.Description like '%'+ s.Word + '%' -- and not exists(select myTableid from plainMatches p where p.myTableId = myTable.myTableId) -- need not look into myTables that have already been found in plainmatches as they are highest ranked -- this one takes a long time though, so commenting it, will have no impact on the result ), matchingRowsWithAllWords as ( select myTableid, count(myTableid) as myTableCount from splitWordMatches group by(myTableid) having count(myTableid) = @countOfSearchStrings ) , -- trim off the CTE here if you don't care about the ordering of words to be considered for priority wordIndexRatings as( -- reverse the char indexes retrived above so that words occuring earlier have higher weightage -- and then normalize them to sequential values select s.myTableid, Priority, Word, ROW_NUMBER() over (partition by s.myTableid order by wordindex desc) as comparativeWordIndex from splitWordMatches s join matchingRowsWithAllWords m on s.myTableId = m.myTableId ) , wordIndexSequenceRatings as ( -- need to do this to ensure that if the same set of words from search string is found in two rows, -- their sequence in the field value is taken into account for higher priority select w.myTableid, w.Word, (w.Priority + w.comparativeWordIndex + coalesce(sequncedPriority ,0)) as Priority from wordIndexRatings w left join ( select w1.myTableid, w1.priority, w1.Word, w1.comparativeWordIndex, count(w1.myTableid) as sequncedPriority from wordIndexRatings w1 join wordIndexRatings w2 on w1.myTableId = w2.myTableId and w1.Priority > w2.Priority and w1.comparativeWordIndex>w2.comparativeWordIndex group by w1.myTableid, w1.priority,w1.Word, w1.comparativeWordIndex ) sequencedPriority on w.myTableId = sequencedPriority.myTableId and w.Priority = sequencedPriority.Priority ), prioritizedSplitWordMatches as ( -- this calculates the cumulative priority for a field value select w1.myTableId, sum(w1.Priority) as OverallPriority from wordIndexSequenceRatings w1 join wordIndexSequenceRatings w2 on w1.myTableId = w2.myTableId where w1.Word <> w2.Word group by w1.myTableid ), completeSet as ( select myTableid, priority from plainMatches -- get plain matches which should be highest ranked union select myTableid, OverallPriority as priority from prioritizedSplitWordMatches -- get ranked split Word matches (which are ordered based on Word rank in search string and sequence) ), maximizedCompleteSet as( -- set the priority of a field value = maximum priority for that field value select myTableid, max(priority) as Priority from completeSet group by myTableId ) select priority, myTable.myTableid , code, Description from maximizedCompleteSet m join myTable on m.myTableId = myTable.myTableId order by Priority desc, Description -- order by priority desc to get highest rated items on top --offset 0 rows fetch next 50 rows only -- optional paging